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Improved phytases 

Phytases are enzymes that hydrolyze phytate (myo-inositol 
hexakisphosphate) to myo-inositol and inorganic phosphate. They 
5 are known to be valuable feed additives. 

The present invention relates to improved phytases, viz. 
phytases of amended characteristics, e.g. amended activity 
characteristics, reference being made to e.g. the phytase(s) it 
has been derived from, or to known phytases. Amended activity 
10 characteristics means amended in at least one phytase activity 
related respect , such as (non-exclusive list) : pH stability, 
temperature stability, pH profile, temperature profile, specific 
activity (in particular in relation to pH and temperature) , 
substrate specificity, substrate cleavage pattern, substrate 
15 binding, position specificity, the velocity and level of release 
of phosphate from corn, reaction rate, phytate degradation 
rate) , end level of released phosphate reached. 

Examples of amended activity characteristics are amended 
specific activity (e.g. increased, e.g. increased at a pH of 3, 
20 4, 5, or 6); amended pH or temperature profile; and/or amended 
(e.g. increased) thermostability, e.g. of an increased melting 
temperature as measured using Differential Scanning Calorimetry 
(DSC) . 

The present invention also relates to a process for the 
25 preparation of a modified protein, wherein in a first step a 
consensus sequence is determined from a number of highly 
homologous sequences according to steps a) , b) and c) below: 

a) at least three, preferably at least four amino acid 
sequences are aligned by any standard alignment program known in 
3 0 the art; 



5808 . 200-US/l40100/KaPe 



2 

b) at every position of the amino acid sequence alignment, 
the amino acids are evaluated for their evolutionary similarity 
and a consensus residue is chosen by any standard program known 
in the art, whereby the minimal requirements for calculation of 

5 a consensus residue are set in such a way that the program is 
already able to determine a consensus residue if a given residue 
occurs in only two of the aligned sequences. However, if there 
is a subgroup of sequences among the compared amino acid 
sequences that shows a much higher degree of similarity with 

10 each other than with the remaining sequences of the alignment, 
the subgroup may be represented in the calculation only with its 
consensus sequence determined in the same way as outlined in 
EP 897985, or alternatively, to each sequence of the subgroup, a 
vote weight of 1 divided by the number of sequences in the 

15 subgroup will be assigned; 

c) in case no consensus amino acid at a defined position 
is identified by the program, any of the amino acids, preferably 
the most frequently occurring amino acid at this position is 
selected. 

20 In a second aspect of the invention, a homologous sequence 

is compared with the consensus sequence, and one or more non- 
consensus residues in this homologous sequence are replaced by 
the corresponding consensus residues. 

Preferably, only such amino acid residues are replaced in 

25 the homologous amino acid sequence where a consensus residue can 
clearly be defined by the program under moderately stringent 
conditions whereas at all positions of the alignment where no 
preferred consensus amino acid can be determined under 
moderately stringent conditions, the amino acids of the 

3 0 homologous protein remain unchanged. 
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In a third aspect of the invention, the active center of 
the protein of interest is determined, comprising all amino acid 
residues that are involved in forming the active center, both in 
the consensus sequence, and in the sequence of a homologous 
protein; subsequently, some or all of the divergent amino acid 
residues of the homologous protein are inserted in the backbone 
of the consensus sequence . 

In one embodiment of this process, the program used for 
the comparison of amino acids at a defined position regarding 
their evolutionary similarity is the program "PRETTY". 

The active center of the protein can be determined by 
using an analysis of the three-dimensional structure of the 
protein. 

An example of a homologous protein is an enzyme family, an 
example of a defined protein family is the family of phytases, 
e.g. of fungal origin. 

For example, the amino acid sequence of the phytase can be 
changed by the introduction of at least one mutation or 
substitution chosen from 



E58A 


F54Y 


D69K 


I73V 


D197N 


K94A 


T214L 


R101A 


E222T 


N153K 


E267D 


V158I 


R291I 


A203G 


R329H 


S205G 


S364T 


V217A 


A379K 


A22 7V 


G4 04A 


V234L 




P238A 
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Q277E 
A2 87H 
A2 92Q 
V366I 

5 A396S 

E415Q 
G437A 
R451E 

For interpreting these abbreviations, as an example, the 
10 mutation E58A is to be interpreted as follows: When subtracting 
2 6 from the number, you get the position or residue number in 
the consensus phytase sequence or another phytase sequence 
aligned as shown in Fig. 1 (corresponding to the addition of a 
2 6 amino acid signal sequence to the sequences shown in Fig. 1) . 
15 For example, in E58A, number 58 means position number 32 (58- 
26=32) . And the letter before the number, i.e. E, represents the 
amino acid in the phytase to be modified which is replaced by 
the amino acid behind the number, i.e. A. 

The above-mentioned amino acid replacements, alone and/or 
20 in combination, have a positive effect on the protein stability. 

The following sub-groups of mutations are also interesting 
(i.e. phytases comprising at least one mutation selected from 
either one of the groups of) : 

E58A, D69K, D197N, T214L, E222T, E267D, R291I, R329H, 
25 S364T, A379K, G404A; 

F54Y, I73V, K94A, R101A, N153K, V158I, A203G, S205G, 
V217A, A227V, V234L, P238A, Q277E, A287H, A292Q, V366I, A396S, 
E415Q, G437A, R451E; 

E58A, D69K, D197N, F54Y, I73V, K94A; 
30 T214L, E222T, E267DR101A, N153K, V158I; 

R291I, R329H, S364TA203G, S205G, V217A; 
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A379K, G404AA227V, V234L, P238A, Q277E; 

A287H, A292Q, V366I, A396S, E415Q, G437A, R451E; 

T214L, E222T, S364T, V158I, A203G, G404A, A227V, P238A, 
A396S, G437A, R451E. 
5 Examples of host cells are plant cells, animal cells, and 

microbial cells, e.g. prokaryotic or eukaryotic cells, such as 
bacterial, fungal or yeast cells. An example of a fungal host is 
a strain of the genus Aspergillus, and examples of yeast hosts 
are strains of Saccharomyces , and strains of Hansenula. 
10 The invention also relates to a modified protein 

obtainable or obtained by any of the processes described above. 

The invention also relates to a variant or mutein of a 
phytase such as (but not limited to) the consensus phytase-1, 
wherein, in the amino acid sequence in Figure 2, at least one 
15 of the following replacements have been effected: Q5 0L, Q50T, 
Q50G, Q50T-Y51N, Q50L-Y51N or Q50T-K91A. 

In the third aspect mentioned above, a consensus sequence 
is determined from homologous sequences as described above; in a 
second step the active center of the protein comprising all 
20 amino acid residues that are involved in forming the active 
center is determined in the consensus sequence and in the 
sequence of a single homologous protein as well. The single 
homologous protein may have preferred properties like high 
specific activity or different pH dependency of enzymatic 
25 activity. In a third step some or all amino acid residues that 
are involved in forming the active center of the homologous 
protein are inserted into the backbone of the consensus 
sequence. The result thereof is a chimeric protein having the 
active center derived from a single protein and the backbone of 
30 the consensus sequence. 
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The active center of the protein can be determined e.g. by 
using any analysis of the three-dimensional structure of the 
protein, e.g. by homology modelling on the basis of a known 3D- 
structure of a known protein. 
5 The present invention also provides consensus proteins 

obtainable or obtained by such processes, in particular proteins 
comprising at least one of the amino acid sequences shown in 
Figures 2-6, 10 or 21, or variants or muteins thereof. Examples 
of such variants are shown in Figs. 7-9. 

10 Such variants or muteins can be defined and prepared on 

the basis of the teachings given in European Patent Application 
number 0897010, e.g. Q50L, Q50T, Q50G, Q50L-Y51N, or Q50T-Y51N. 
These mutations are defined as above, or, alternatively, by 
reference to Fig. 2. When referring to Fig. 2, no subtraction of 

is the 26 amino acid signal peptide is required (e.g. in "Q50L, " at 
position 5 0 of the amino acid sequence of Fig. 2, the amino acid 
Q has been replaced by amino acid L) . 

A food, feed, or pharmaceutical composition comprising the 
phytases of the invention is another aspect of the invention. 

20 In this context, and relating to the process of the 

invention, "at least three, preferably at least four amino acid 
sequences of such defined protein family" means that three, 
four, five, six to twelve, twenty, fifty, or even more sequences 
can be used for the alignment and the comparison to create the 

25 amino acid sequence of the consensus protein. "Sequences of a 
defined protein family" means that such sequences fold into a 
three-dimensional structure, wherein the alpha-helices, the 
beta-sheets and beta-turns are at the same position so that such 
structures are, as called by the man skilled in the art, largely 

30 superimposable. Furthermore these sequences characterize 
proteins that show the same type of biological activity, e.g. a 
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defined enzyme class, e.g. the phytases. The three-dimensional 
structure of one such protein is sufficient to allow the 
modelling of the structure of the other homologous proteins of 
such a family. An example, how this can be done, is given in 
5 Example 1. "Evolutionary similarity" in the context of the 
present invention refers to a scheme which classifies amino 
acids regarding their structural similarity which allows that 
one amino acid can be replaced by another amino acid with a 
minimal influence on the overall structure, as this is done e.g. 

10 by programs, like "PRETTY", known in the art. The phrase "the 
degree of similarity provided by such a program... is set to less 
stringent number" means in the context of the present invention 
that values for the parameters which determine the degree of 
similarity in the program used in the practice of the present 

is invention are chosen in a way to allow the program to define a 
consensus amino acid for a maximum of positions of the whole 
amino acid sequence, e. g. in case of the program PRETTY a value 
of 2 or 3 for the THRESHOLD and a value of 2 for the PLURALITY 
can be chosen. Furthermore, "a vote weight of one divided by the 

20 number of such sequences" means in the context of the present 
invention that the sequences which define a group of sequences 
with a higher degree of similarity as the other sequences used 
for the determination of the consensus sequence only contribute 
to such determination with a factor which is equal to one 

25 divided by the number of all sequences of this group. 

As mentioned before, should the program not allow to 
select the consensus amino acid, the most frequent amino acid is 
selected; should the latter be impossible the man skilled in the 
art will select an amino acid from all the sequences used for 

30 the comparison which is known in the art for its property to 
improve the thermostability in proteins as discussed e.q. by 
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Janecek, S. (1993), Process Biochem. 28, 435-445; Fersht, A. R. 
& Serrano, L. (1993), Curr. Opin. Struct. Biol. 3, 75-83; Alber, 
T. (1989), Annu. Rev. Biochem. 58, 765-798; Matthews, B. W. 
(1987), Biochemistry 26, 6885-6888; or Matthews, B. W. (1991), 
5 Curr. Opin. Struct. Biol. 1, 17-21. 

The stability of an enzyme is relevant for many industrial 
applications. Therefore, a lot of attempts, more or less 
successful, have been made to improve the stability, preferably 
the thermostability of enzymes by rational or random approaches. 

io Here we present an alternative way to improve the 

thermostability of a protein. 

The invention provides a process for the preparation of a 
consensus protein comprising a process to calculate an amino 
acid residue for nearly all positions of a so-called consensus 

15 protein and to synthesize a complete gene from this sequence 
that can be expressed in a pro- or eukaryotic expression system. 

DNA sequences of the present invention can be constructed 
starting from genomic or cDNA sequences encoding the proteins, 
e.g. phytases, of interest. For example, they can be constructed 

20 by methods of in vitro mutagenesis [see e.g. Sambrook et al . , 
Molecular Cloning, Cold Spring Harbor Laboratory Press, New 
York]. A widely used strategy for "site-directed mutagenesis", 
as originally outlined by Hurchinson and Edgell [J. Virol. 8, 
181 (1971)], involves the annealing of a synthetic 

25 oligonucleotide carrying the desired nucleotide substitution to 
a target region of a single-stranded DNA sequence wherein the 
mutation should be introduced [for review see Smith, Annu. Rev. 
Genet. 19, 423 (1985), and for improved methods, see references 
2-6 in Stanssen et al . , Nucl. Acids Res., 17, 4441-4454 (1989). 

30 Another possibility of mutating a given DNA sequence is the 
mutagenesis by using the polymerase chain reaction (PCR) . DNA as 
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starting material can be isolated by methods known in the art 
and described e.g. in Sambrook et al . (Molecular Cloning) from 
the respective strains. 

For strain information, see e.g. EP 684313 or any 
5 depository authority indicated below. Aspergillus niger [ATCC 
9142], Myceliophthora thermophila [ATCC 48102], Talaromyces 
thermophilus [ATCC 20186] and Aspergillus fumigatus [ATCC 34625] 
have been redeposited according to the conditions of the 
Budapest Treaty at the American Type Culture Cell Collection 

10 under the following accession numbers : ATCC 7433 7 , ATCC 7434 0 , 
ATCC 74338 and ATCC 74339, respectively. It is, however, 
understood that DNA encoding a consensus protein in accordance 
with the present invention can also be prepared in a synthetic 
manner as described, e.g. in EP 747483 or EP 897985, or in the 

is examples, by methods known in the art. 

For sequence information, see e.g. EP 684313, or sequence 
data bases, for example like Genbank (Intelligenetics , 
California, USA) , European Bioinf ormatics Institute (Hinston 
Hall, Cambridge, GB) , NBRF (Georgetown University, Medical 

20 Centre, Washington DC, USA) and Vecbase (University of 
Wisconsin, Biotechnology Centre, Madison, Wisconsin, USA) . 

The process of the present invention can e.g. be used to 
improve the thermostability of the enzyme phytase. 

Once complete DNA sequences of the present invention have 

25 been obtained they can be integrated into vectors by methods 
known in the art and described e.g. in Sambrook et al . (s.a.) to 
overexpress the encoded polypeptide in appropriate host systems. 
However, a man skilled in the art knows that also the DNA 
sequences themselves can be used to transform the suitable host 

3 0 systems of the invention to get overexpression of the encoded 
polypeptide. Appropriate host systems are for example fungi, 
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like Aspergilli, e.g. Aspergillus niger [ATCC 9142] or 
Aspergillus ficuum [NRRL 3135] or like Trichoderma, e.g. 
Trichoderma reesei; or yeasts, like Saccharomyces, e.g. 
Saccharomyces cerevisiae or Pichia, like Pichia pastoris, or 
5 Hansenula polymorpha, e.g. H. polymorpha (DSM5215) ; or plants, 
as described, e.g. by Pen et al . , Bio/Technology 11, 811-814 
(1994) . A man skilled in the art knows that such microorganisms 
are available from depository authorities, e.g. the American 
Type Culture Collection (ATCC) , the Centraalbureau voor 

10 Schimmelcultures (CBS) or the Deutsche Sammlung fur 
Mikroorganismen und Zellkulturen GmbH (DSM) or any other 
depository authority as listed in the Journal "Industrial 
Property" [(1991) 1, pages 29-40]. Bacteria which can be used 
are e.g. E. coli; Bacilli as, e.g., Bacillus subtilis; or 

is Streptomyces, e.g. Streptomyces lividans (see e.g. Anne and 
Mallaert in FEMS Microbiol. Lett. 114, 121 (1993). Preferred E. 
coli strains, which can be used are E. coli K12 strains e.g. M15 
[described as DZ 291 by Villarejo et al . in J. Bacterid . 120, 
466-474 (1974)], HB 101 [ATCC No. 33694] or E. coli SG13009 

20 [Gottesman et al . , J. Bacteriol . 148, 265-273 (1981)]. 

Vectors which can be used for expression in fungi are 
known in the art and described e.g. in EP 420358, or by Cullen 
et al. [Bio/Technology 5, 369-376 (1987)], Ward [Molecular 
Industrial Mycology, Systems and Applications for Filamentous 

25 Fungi, Marcel Dekker, New York (1991)], Upshall et al . 
[Bio/Technology 5, 1301-1304 (1987)], Gwynne et al . 
[Bio/Technology 5, 71-79 (1987)], or Punt et al . [J. Biotechnol . 
17, 19-34 (1991)]; and for yeasts by Sreekrishna et al . [J. 
Basic Microbiol. 28, 265-278 (1988), Biochemistry 28, 4117-4125 

30 (1989)], Hitzemann et al . [Nature 293, 717-722 (1981)] or in EP 
183070, EP 183071, EP 248227, or EP 263311. Suitable vectors 
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which can be used for expression in E. coli are mentioned, e.g. 
by Sambrook et al . [s.a.], Fiers et al . [Procd. 8th Int. 
Biotechnology Symposium", Soc . Franc. de Microbiol., Paris 
(Durand et al . , eds . ) , pp. 680-697 (1988)], Bujard et al . [Meth. 
5 Enzymol. 155, 416-433 (1987)], or Stuber et al . [Immunological 
Methods, eds. Lefkovits and Pernis, Academic Press, Inc., Vol. 
IV, 121-152 (1990)]. Vectors that can be used for expression in 
Bacilli are known in the art and described, e.g. in EP 207459, 
EP 405370, Proc. Natl. Acad. Sci . USA 81, 439 (1984) or Yansura 

10 and Henner, Meth. Enzymol. 185, 199-228 (1990). Vectors which 
can be used for the expression in H. Polymorpha are known in the 
art and described, e.g. in Gellissen et al . , Biotechnology 9, 
291-295 (1991) . 

Either such vectors already carry regulatory elements, 

is e.g. promotors, or the DNA sequences of the present invention 
can be engineered to contain such elements. Suitable promotor 
elements which can be used are known in the art and are, e.g. 
for Trichoderma reesei the cbhl- [Haarki et al . , Biotechnology 
7, 596-600 (1989)] or the pkil -promotor [Schindler et al . , Gene 

20 130, 271-275 (1993)]; for Aspergillus oryzae the amy-promotor 
[Christensen et al . , Abstr. 19th Lunteren Lectures on Molecular 
Genetics F23 (1987), Christensen et al . , Biotechnology 6, 1419- 
1422 (1988), Tada et al . , Mol . Gen. Genet. 229, 301 (1991)]; and 
for Aspergillus niger the glaA- [Cull en et al . , Bio/Technology 

25 5, 369-376 (1987), Gwynne et al . , Bio/Technology 5, 713-719 
(1987), Ward in Molecular Industrial Mycology, Systems and 
Applications for Filamentous Fungi, Marcel Dekker, New York, 83- 
106 (1991)], alcA- [Gwynne et al . , Bio/Technology 5, 718-719 
(1987)], sucl- [Boddy et al . , Curr. Genet. 24, 60-66 (1993)], 

30 aphA- [MacRae et al . , Gene 71, 339-348 (1988), MacRae et al . , 
Gene 132, 193-198 (1993)], tpiA- [McKnight et al . , Cell 46, 143- 
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147 (1986), Upshall et al . , Bio/Technology 5, 1301-1304 (1987)], 
gpdA- [Punt et al., Gene 69, 49-57 (1988), Punt et al . , J. 
Biotechnol. 17, 19-37 (1991)] and the pkiA-promotor [de Graaff 
et al., Curr. Genet. 22, 21-27 (1992)]. Suitable promoter 
5 elements that can be used for expression in yeast are known in 
the art and are, e.g. the pho5 -promoter [Vogel et al . 7 Mol . 
Cell. Biol., 2050-2057 (1989); Rudolf and Hinnen, Proc . Natl. 
Acad. Sci. 84, 1340-1344 (1987)] or the gap-promotor for 
expression in Saccharomyces cerevisiae; the aoxl-promotor [Koutz 
10 et al., Yeast 5, 167-177 (1989); Sreekrishna et al . , J. Basic 
Microbiol. 28, 265-278 (1988)] for Pichia pastoris; or the FMD 
promoter [Hollenberg et al . , EPA No. 0299108] or MOX-promotor 
[Ledeboer et al . , Nucl. Acids Res. 13, 3063-3082 (1985)] for H. 
polymorpha . 

15 Accordingly vectors comprising DNA sequences of the 

present invention, preferably for the expression of said DNA 
sequences in bacteria or a fungal or a yeast host and such 
transformed bacteria or fungal or yeast hosts are also a part of 
the invention . 

20 The invention also provides a system that allows for high 

expression of proteins, in particular of the phytases of the 
invention, such as recombinant Hansenula strains. To achieve 
that, the codons of the DNA sequence of such a protein may be 
selected on the basis of a codon frequency table of the organism 

25 used for expression, e.g. of yeast as in the present case (see 
e.g. in Example 1). Optionally, the codons for the signal 
sequence may be selected in a manner as described for the 
specific case in Example 1; that means that a codon frequency 
table is prepared on the basis of the codons used in the DNA 

30 sequences which encode the amino acid sequences of the given 
protein family. Then the codons for the design of the DNA 
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sequence of the signal sequence are selected from a codon 
frequency table of the host cell used for expression whereby 
always codons of comparable frequency in both tables are used. 

Once such DNA sequences have been expressed in an 
5 appropriate host cell in a suitable medium, the encoded protein 
can be isolated either from the medium in the case the protein 
is secreted into the medium or from the host organism in case 
such protein is present intracellularly by methods known in the 
art of protein purification or described in case of a phytase, 

10 e.g. in EP 420358. Accordingly, a process for the preparation of 
a polypeptide of the present invention wherein transformed 
bacteria or a host cell as described above are cultured under 
suitable culture conditions, and the polypeptide is recovered 
therefrom and a polypeptide when produced by such a process; or 

is a polypeptide encoded by a DNA sequence of the present 
invention, are also a part of the present invention. 

Once obtained, the polypeptides of the present invention 
can be characterized regarding their properties that make them 
useful in agriculture by any assay known in the art. 

20 In general, the polypeptides of the present invention can 

be used without being limited to a specific field of 
application, e.g. in case of phytases for the conversion of 
inositol polyphosphates, like phytate, to inositol and inorganic 
phosphate . 

25 Furthermore, the polypeptides of the present invention can 

be used in a process for the preparation of a pharmaceutical 
composition or compound food or feeds wherein the components of 
such a composition are mixed with at least one polypeptide of 
the present invention. Accordingly, compound food or feeds or 

30 pharmaceutical compositions comprising at least one polypeptide 
of the present invention are also a part of the present 
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invention. A man skilled in the art is familiar with their 
process of preparation. Such pharmaceutical compositions or 
compound foods or feeds can further comprise additives or 
components generally used for such purpose and known in the 
5 state of the art. 

The present invention also provides a process for the 
reduction of levels of phytate in animal manure wherein an 
animal is fed such a feed composition in an amount effective in 
converting phytate contained in the feedstuff to lower inositol 

10 phosphates and/or inositol, and inorganic phosphate. 

In the present context, a phytase is an enzyme or 
polypeptide that has phytase activity. A phytase can be e.g. a 
myo-inositol hexakisphosphate phosphohydrolase, such as (myo- 
inositol hexakisphosphate 3 -phosphohydrolase , EC 3.1.3.8) and 

is (myo-inositol hexakisphosphate 6 -phosphohydrolase, EC 3.1.3.26). 

In one embodiment, the phytase is purified, viz. at least 
85%, preferably at least 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 
96, 97, 98, 99% pure, as evaluated by SDS-PAGE. The phytase may 
be isolated. Phytase activity can be determined using any 

20 phytase assay known in the art, e.g. the assay described herein 
(see Example 9) . The assay temperature may be the optimum 
temperature of the actual phytase, and the assay pH may be the 
optimum pH of the actual phytase. 

The assay temperature may e.g. be selected within the 

25 range of 20-90°C, or 30-80°C, or 35-75°C, for instance 
temperatures of 37°C, 50°C, 60°C, or 70°C. 

The assay pH may e.g. be selected within the range of pH 
2-9, or 3-8, or 3-6, for instance assay pH values of 3, 4, 5, 6, 
or 7 may be chosen. 

30 Amino acid sequence homology (or polypeptide or amino acid 

homology) is determined as the degree of identity between two 
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sequences. This may suitably be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package [Program Manual for the Wisconsin Package, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin 
5 53711, USA], see also Needleman, S.B. and Wunsch, CD., (1970), 
J. Mol. Biol., 48, 443-453]. In release 9.1, for comparing 
polypeptide sequences, the Length Weight is set to 0, and the 
Gap Weight is set to 3.0. 

The degree of identity or homology between two DNA 

10 (nucleic acid) sequences may be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package [Program Manual for the Wisconsin Package, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin 
53711, USA), see also Needleman, S.B. and Wunsch, CD., (1970), 

15 J. Mol. Biol., 48, 443-453]. In release 9.1, GAP is used with 
the following settings for DNA sequence comparison: GAP creation 
penalty of 50 and GAP extension penalty of 3. 

Suitable experimental conditions for determining whether a 
given DNA or RNA sequence hybridizes to a specified nucleotide 

20 or oligonucleotide probe involves presoaking of the filter 
containing the DNA or RNA fragments to examine for hybridization 
in 5 x SSC (Sodium chloride/Sodium citrate; (J. Sambrook, E.F. 
Fritsch, and T. Maniatis, 1989, Molecular Cloning, A Laboratory 
Manual, 2nd edition, Cold Spring Harbor, New York) for 10 min, 

25 and prehybridization of the filter in a solution of 5 x SSC, 5 x 
Denhardt's solution, 0.5 % SDS and 10 0 fig /ml of denatured 
sonicated salmon sperm DNA (Sambrook et al . 1989), followed by 
hybridization in the same solution containing a concentration of 
10 ng/ml of a random-primed (Feinberg, A. P. and Vogel stein, B. 

30 (1983) Anal. Biochem. 132:6-13), 32 P-dCTP-labeled (specific 
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activity > 1 x 10 9 cpm/ptg) probe for 12 hours at approximately 
45°C. 

The filter is then washed twice for 30 minutes in 2 x SSC, 
0.5 % SDS at at least 55°C (low stringency), at at least 60°C 
5 (medium stringency) , at at least 65 °C (medium/high stringency) , 
at at least 70°C (high stringency) , or at at least 75°C (very 
high stringency) . 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions can be detected using an x-ray film. 
io Phytases of amended thermostability, or thermostable 

phytases, are one aspect of the present invention. A 
xx thermostable ' ' phytase is a phytase that has a Tm (melting 
temperature) - as measured on purified phytase protein by 
Differential Scanning Calorimetry (DSC) - of at least 65 °C. For 
is the DSC, a constant heating rate may be used, e.g. of 10°C/min. 
In alternative embodiments, the Tm is at least 66, 67, 68, 69, 
70, 71, 72, 73, 74 or 75°C. Or, the Tm is equal to or lower than 
150°C, or equal to or lower than 145, 140, 135, 130, 125, 120, 
115 or 110 °C. Accordingly, examples of intervals of Tm are: 65- 
20 150°C, 66-150°C, - (etc.) - 75-150°C; 65-145°C, 66-145°C, - 
(etc.) - 75-145°C; 65-140°C, - (etc.) - 75-140°C; - (etc.) - 65- 
110°C, 66-110°C, - (etc.) - 75-110°C. 

Particular ranges for Tm are the following: between 65 and 
110 °C; between 7 0 and 110 °C; between 7 0 and 100 °C; between 75 
25 and 95°C, or between 80 and 90°C. 

In Examples 9 and 10 below, the measurement of Tm by DSC 
is described, and the Tm 7 s of a number of phytases are shown. 

The optimum temperatures are also indicated, since - as an 
alternative mean - a thermostable phytase can be defined as a 
30 phytase having a temperature-optimum of at least 60 °C. 
Preferably, the optimum temperature is determined on the 
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substrate phytate or phytic acid at pH 5.0 or 5.5. Example 9 
describes an example of a phytase assay, including a definition 
of units. 

In alternative embodiments, the optimum temperature is at 
5 least 61, 62, 63, 64, 65, 66, 67, 68, 69 or 70°C. In a 
particular embodiment, the optimum temperature is equal to or 
lower than 14 0°C, or equal to or lower than 13 5, 13 0, 12 5, 12 0, 
115, 110, 105 or 100 °C. Accordingly, examples of intervals of 
optimum temperature are: 60-140°C, 61-140°C, - (etc.) - 70- 
10 140°C; 60-135°C, 61-135°C, - (etc.) - 70-135°C; 60-130°C, - 
(etc.) - 70-130°C; - (etc.) - 60-100°C, 61-100°C, - (etc.) - 70- 
100°C. 

Before describing the present invention in more detail a 
short explanation of the Figures enclosed is given below. 

15 

Figure 1 : Design of the consensus phytase- 1 sequence. The 
letters represent the amino acid residues in the one-letter 
code. The following sequences were used for the alignment: phyA 
from Aspergillus terreus 9A-1 [Mitchell, D. B., Vogel, K. , 

20 Weimann, B. J., Pasamontes, L. & van Loon, A. P. G. M. (1997) 
The phytase subfamily of histidine acid phosphatases: isolation 
of genes for two novel phytases from the fungi Aspergillus 
terreus and Myceliophthora thermophila, Microbiology 143, 245- 
252); from amino acid (aa) 27; SEQ ID NO: 1]; phyA from A. 

25 terreus cbsll6.46 [EP 897985]. A heat resistant phytase of 
Aspergillus fumigatus with superior performance in animal 
experiments. Phytase optimization and natural variability. In: 
The Biochemistry of phytate and phytases (eds. Rasmussen, S.K; 
Raboy, V.; Dalb0ge, H. and Loewus, F.; Kluwer Academic 

30 Publishers); from aa 27; SEQ ID NO: 2; phyA from Aspergillus 
niger var. awamori (Piddington et al (1993) Gene 133, 55-62; 
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from aa 27; SEQ ID NO: 3); phyA from A. niger T213 (EP 897985); 
from aa 27; SEQ ID NO: 4); phyA from A. niger strain NRRL3135 
[van Hartingsveldt , W. , van Zeijl, C. M. F. # Harteveld, G. M. , 
Gouka, R. J., Suykerbuyk, M . E. G. , Luiten, R. G. M. , van 
5 Paridon, P. A. , Selten, G. C. M . , Veenstra, A. E., van Gorcom, 
R. F. M., & van den Hondel, C. A. M. J. J. (1993) Cloning, 
characterization and overexpression of the phytase-encoding gene 
(phyA) of Aspergillus niger. Gene 127, 87-94; from aa 27; SEQ ID 
NO: 5] ; phyA from Aspergillus fumigatus ATCC 13 0 73 (Pasamontes, 

10 L., Haiker, M. , Wyss, M. , Tessier, M. & van Loon, A. P. G. M. 
(1997) Cloning, purification and characterization of a heat 
stable phytase from the fungus Aspergillus fumigatus, Appl . 
Environ. Microbiol. 63, 1696-1700; from aa 25; SEQ ID NO: 6] ; 
phyA from A. fumigatus ATCC 32722 (EP 897985); from aa 27; SEQ 

15 ID NO: 7) ; phyA from A. fumigatus ATCC 58128 (EP 897985) ; from 
aa 27; SEQ ID NO: 8) ; phyA from A. fumigatus ATCC 2 6906 
(EP 897985); from aa 27; SEQ ID NO: 9); phyA from A. fumigatus 
ATCC 32239 (EP 897985); from aa 30; SEQ ID NO: 10; phyA from 
Emericella nidulans [Pasamontes, L., Haiker, M., Henriquez- 

20 Huecas, M. , Mitchell, D. B. & van Loon, A. P. G. M. (1997a). 
Cloning of the phytases from Emericella nidulans and the 
thermophilic fungus Talaromyces thermophilus . Biochim. Biophys . 
Acta 1353, 217-223; from aa 25; SEQ ID NO: 11]; phyA from 
Talaromyces thermophilus (Pasamontes et al . , 1997a; from aa 24; 

25 SEQ ID NO: 12) ; and phyA from Myceliophthora thermophila 
(Mitchell et al . , 1997; from aa 19; SEQ ID NO: 13). The 
alignment was calculated using the program PILEUP. The location 
of the gaps was refined by hand. Capitalized amino acid residues 
in the alignment at a given position belong to the amino acid 

30 coalition that establish the consensus residue. In bold, beneath 
the calculated consensus sequence, the amino acid sequence of 
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the finally constructed consensus phytase (Fcp) is shown (SEQ ID 
NO: 14) . The gaps in the calculated consensus sequence were 
filled by hand according to principals stated in Example 1. 

5 Figure 2 : DNA sequence (SEQ ID NO: 15) of the consensus 

phytase -1 gene (fcp) and of the primers used for the gene con- 
struction. The calculated amino acid sequence (Figure 1, SEQ ID 
NO: 14) was converted into a DNA sequence using the program 
BACKTRANSLATE [Devereux, J., Haeberli, P. & Smithies, 0. (1984) 

io A comprehensive set of sequence analysis programs for the VAX. 
Nucl. Acids Res. 12, 387-395], and the codon frequency table of 
highly expressed yeast genes (GCG program package, 9.0). The 
signal peptide of the phytase from A. terreus cbs 116.46 was 
fused to the N- terminus. The amino acid sequence shown in Fig. 2 

15 is SEQ ID NO: 16. The bold bases represent the sequences of the 
oligonucleotides used to generate the gene. The names of the re- 
spective oligonucleotides are alternately noted above or below 
the sequence. The underlined bases represent the start and stop 
codon of the gene. The bases written in italics represent the 

20 two introduced Eco RI sites. 

Figure 3 : Alignment and consensus sequence of five 
Basidiomycete phytases. The letters represent the amino acid 
residues in the one -letter code. The amino acid sequences of the 

25 phytases from Paxillus involutus, phyAl (from aa 21; SEQ ID NO: 
17; and phyA2 (from aa 21, WO 98/28409; SEQ ID NO: 18); Trametes 
pubescens (from aa 24, WO 98/28409; SEQ ID NO: 19); Agrocybe 
pediades (from aa 19, WO 98/28409; SEQ ID NO: 20); and 
Peniophora lycii (from aa 21, WO 98/28409; SEQ ID NO: 21), 

30 starting with the amino acid residues mentioned in parentheses, 
were used for the alignment and the calculation of the 
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corresponding consensus sequence called "Basidio" (Example 2; 
SEQ ID NO: 22) . The alignment was performed with the program 
PILEPUP. The location of the gaps was refined by hand. The 
consensus sequence was calculated by the program PRETTY. While a 
5 vote weight of 0.5 was assigned to the two P. involutus 
phytases, all other genes were used with a vote weight of 1.0 
for the consensus sequence calculation. At positions where the 
program was not able to determine a consensus residue, the 
Basidio sequence contains a dash. Capitalized amino acid 
10 residues in the alignment at a given position represent the 
amino acid coalition that established the consensus residue. 

Figure 4 : Design of consensus phytase-10 amino acid 
sequence. By adding the sequence of Thermomyces lanuginosus 

15 phytase [Berka, R. M., Rey, M. W., Brown, K. M . , Byun, T. & 
Klotz, A. V. (1998) Molecular characterization and expression of 
a phytase gene from the thermophilic fungus Thermomyces 
lanuginosus. Appl . Environ. Microbiol. 64, 4423-4427; SEQ ID NO: 
23] and the consensus sequence of the phytases from five 

20 Basidiomycetes (SEQ ID NO: 22) to the alignment of Figure 1, an 
improved consensus sequence was calculated by the program 
PRETTY. Additionally, the amino acid sequence of A. niger T213 
was omitted, and a vote weight of 0.5 was assigned to the 
remaining two A. niger phytase sequences. For further 

25 information see Example 2 . 

Figure 5 : DNA and amino acid sequence of consensus 
phytase-10 (SEQ ID NO: 25, and SEQ ID NO: 26, respectively) . The 
amino acid sequence is written above the corresponding DNA 
30 sequence using the one-letter code. The sequence of the 
oligonucleotides that were used to assemble the gene are in 
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bold letters. The names of the respective oligonucleotides and 
the amino acids that differ relative to consensus phytase-1 are 
underlined. The fcplO gene was assembled from the following 
oligonucleotides: CP-1, CP-2, CP-3.10, CP-4.10, CP-5.10, CP-6 7 
5 CP-7.10, CP-8.10, CP-9.10, CP-10.10, CP-11.10, CP-12.10, CP- 
13.10, CP-14.10, CP-15.10, CP-16.10, CP-17.10, CP18.10, CP- 
19.10, CP-20.10, CP-21.10, and CP-22.10. The newly synthesized 
oligonucleotides are additionally marked by the number 10. The 
phytase contains the following 32 exchanges relative to 

10 consensus phytase-1: Y54F, E58A, D69K 7 D70G, A94K, N134Q, I158V, 
S187A, Q188N, D197N , S204A, T214L, D220E, L234V 7 A238P r D246H, 
T251N, Y259N, E267D, E277Q, A283D, R291I , A320V, R329H 7 S364T, 
I366V, A379K, S396A 7 G404A 7 Q415E; A437G 7 A463E. The underlined 
mutations revealed a stabilizing effect on consensus phytase-1 

15 when tested as single mutations in consensus phytase-1. 

Figure 6 : Alignment for the design of consensus phytase- 11 
(SEQ ID NO: 27) . In contrast to the design of consensus phytase- 
10, for the design of the amino acid sequence of consensus 
20 phytase -11, all Basidiomycete phytases were used as independent 
sequences using an assigned vote weight of 0.2 for each 
Basidiomycete sequence. Additionally, the amino acid sequence of 
A. niger T213 was again used in this alignment. 

25 Figure 7 : DNA and amino acid sequence of consensus 

phytase- 1-thermo [8] -Q50T-K91A (SEQ ID NO: 28, and SEQ ID NO: 29, 
respectively) . The amino acid sequence is written above the 
corresponding DNA sequence using the one-letter code. The 
replaced amino acid residues (relative to consensus phytase-1) 

30 are underlined. The stop codon of the gene is marked by a star 
(*) • 
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Figure 8 : DNA and amino acid sequence of consensus 
phytase-10-thermo [3] -Q50T-K91A (SEQ ID NO: 30, and SEQ ID NO: 
31, respectively) . The amino acid sequence is written above the 
5 corresponding DNA sequence using the one-letter code. The 
replaced amino acid residues (relative to consensus phytase-10) 
are underlined. The stop codon of the gene is marked by a star 
(*) . 

10 Figure 9 : DNA and amino acid sequence of A. fumigatus ATCC 

13073 phytase alpha-mutant Q51T (SEQ ID NO: 32, and SEQ ID NO: 
33, respectively). The amino acid sequence is written above the 
corresponding DNA sequence using the one-letter code. The 
replaced amino acid residues (relative to A. fumigatus ATCC 

15 13073 phytase) are underlined. The stop codon of the gene is 
marked by a star (*) . 

Figure 10 : DNA and amino acid sequence of consensus 
phytase-7 (SEQ ID NO: 34, and SEQ ID NO: 35, respectively) . The 

20 amino acids are written above the corresponding DNA sequence 
using the one- letter code. The sequence of the oligonucleotides 
used to assemble the gene are in bold letters. Oligonucleotides 
and amino acids that were exchanged (relative to consensus 
phytase- 1) are underlined and the corresponding triplets are 

25 written in small case letters. The fcp7 gene was assembled from 
the following oligonucleotides: CP-1, CP-2, CP-3, CP-4.7, CP- 
5.7, CP-6, CP-7, CP-8.7, CP-9, CP-10.7, CP-11.7, CP-12.7, CP- 
13.7, CP-14.7, CP-15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, 
CP-21, and CP-22 . The newly synthesized oligonucleotides are 

30 additionally marked by the number 7. Consensus phytase-7 
contains the following 24 exchanges in comparison to the 
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original consensus phytase-1: S89D, S92G, A94K, D164S, P201S, 
G203A, G205S, H212P, G224A, D226T, E255T, D256E, V258T, P265S, 
Q292H, G300K, Y305H, A314T, S364G, M365I, A397S, S398A, G404A, 
and A4 05S. 

5 

Figure 11 : Differential scanning calorimetry (DSC) of 
consensus phytase-1 and consensus phytase-10. The protein 
samples were concentrated to about 50-60 mg/ml and extensively 
dialyzed against 10 mM sodium acetate, pH 5.0. A constant 
io heating rate of 10°C/min was applied up to 95°C. DSC of 
consensus phytase-10 (upper graph) yielded a melting temperature 
of 85.4°C, which is 7.3°C higher than the melting point of 
consensus phytase-1 (78.1°C, lower graph) . 

is Figure 12 : Differential scanning calorimetry (DSC) of 

consensus phytase-10 -thermo [3] -Q50T and consensus phytase-10- 
thermo [3] -Q50T-K91A. The protein samples were concentrated to 
ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium 
acetate, pH 5.0. A constant heating rate of 10°C/min was applied 

20 up to 95°C. DSC of consensus phytase-10 -thermo [3] -Q50T (upper 
graph) yielded a melting temperature of 88.6°C, while the 
melting temperature of consensus phytase-10-thermo-Q50T-K91A was 
determined to be 89.3°C. 

25 Figure 13 : Comparison of the temperature optimum between 

consensus phytase-1, consensus phytase-10 and consensus phytase- 
10-thermo [3] -Q50T. For the determination of the temperature 
optimum, the phytase standard assay of Example 9 was performed 
at a series of temperatures between 3 7 and 86 °C. The diluted 

30 supernatant of transformed S. cerevisiae strains was used for 
the determination. The other components of the supernatant had 
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no influence on the determination of the temperature optimum: a, 

consensus phytase-1; consensus phytase-10; ■ , consensus 

phytase 10-thermo [3] -Q50T. 

5 Figure 14 : pH-dependent activity profile and substrate 

specificity of consensus phytase-10 and its variants thermo [3]- 
Q50T and thermo [3] -Q50T-K91A. The phytase activity was 
determined using the standard assay in appropriate buffers (see 
Example 9) at different pH-values. Graph a) shows the pH- 
10 dependent activity profile of consensus phytase-10 (□) , 

consensus phytase- 10 -thermo [3] -Q50T (•), and consensus phytase- 
10-thermo [3] -Q50T-K91A (a). Graph b) shows the corresponding 
substrate specificity tested by replacement of phytate in the 
standard assay by the indicated compounds; open bars, consensus 

15 phytase-10; grey bars, consensus phytase- 10 -thermo [3] -Q50T; dark 
bars, consensus phytase- 10 -thermo [3] -Q50T-K91A) . The numbers 
correspond to the following substrates: 1, phytate; 2, p- 
nitrophenyl phosphate; 3, phenyl phosphate; 4, f ructose-1 , 6- 
bisphosphate; 5, fructose -6 -phosphate; 6, glucose -6 -phosphate; 

20 7, ribose- 5 -phosphate; 8, DL-glycerol- 3 -phosphate; 9, glycerol - 
2 -phosphate; 10, 3-phosphoglycerate; 11, phosphoenolpyruvate ; 
12, AMP; 13, ADP; 14, ATP. 

Figure 15 : pH-dependent activity profile and substrate 
25 specificity of consensus phytase-1 -thermo [8] -Q50T and of 
consensus phytase-1 -thermo [8] -Q50T-K91A. The phytase activity 
was determined using the standard assay in appropriate buffers 
(see Example 9) at different pH- values. Graph a) shows the pH- 

dependent activity profile of the Q50T- (■) and the Q50T-K91A- 
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variant ( . ) . Graph b) shows the corresponding substrate 
specificities tested by replacement of phytate in the standard 
assay by the indicated compounds (open bars, consensus phytase - 
1-thermo [8] -Q50T; filled bars 7 consensus phytase-l-thermo [8] - 
5 Q50T-K91A) . The substrates are listed in the legend of Figure 
14 . 

Figure 16 : Differential scanning calorimetry (DSC) of 
consensus phytase-l-thermo [8] -Q50T and consensus phytase-1- 

10 thermo [8] -Q50T-K91A. The protein samples were concentrated to 
ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium 
acetate, pH 5.0. A constant heating rate of 10°C/min was applied 
up to 95°C. DSC of consensus phytase-l-thermo [8] -Q50T (upper 
graph) showed a melting temperature of 84.7°C, while the melting 

15 point of consensus phytase-l-thermo [8] -Q50T-K91A was found at 
85 . 7°C. 

Figure 17 : Comparison of the temperature optimum between 
consensus phytase-1, consensus phytase-l-thermo [3] and consensus 
20 phytase-l-thermo [8] . For the determination of the temperature 
optimum, the phytase standard assay was performed at a series of 
temperatures between 37 and 86°C. Protein purified from the 
supernatant of transformed S. cerevisiae strains was used for 
the determination. O, consensus phytase- 1; □, consensus 

25 phytase-l-thermo [3] ; ^, consensus phytase l-thermo[8]. 

Figure 18 : Comparison of the pH-dependent activity profile 
and substrate specificity between consensus phytase- 1, consensus 
phytase-7, and the phytase from A. niger NRRL 3135. The phytase 
30 activity was determined using the standard assay in appropriate 
buffers (see Example 9) at different pH-values . Graph a) shows 
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the pH- dependent activity profile of consensus phytase-1 (■) , 
the phytase from A. niger NRRL 313 5 (0) , and of consensus 

phytase-7 (^). Graph b) shows the corresponding substrate 
specificities tested by replacement of phytate in the standard 
5 assay by the indicated compounds (black bars, A. niger NRRL 313 5 
phytase; open bars, consensus phytase-1; dashed bars, consensus 
phytase-7) . The substrates are listed in the legend of Figure 
14. 

10 Figure 19 : Differential scanning calorimetry (DSC) of the 

phytase from A. fumigatus ATCC 13073 and of its stabilized 
alpha-mutant, which contains the following amino acid exchanges: 
F55Y, V100I, F114Y, A243L, S265P, and N294D. 

The protein samples were concentrated to ca. 50-60 mg/ml 

is and extensively dialyzed against 10 mM sodium acetate, pH 5.0. A 
constant heating rate of 10°C/min was applied up to 95°C. DSC of 
A. fumigatus 13073 phytase (lower graph) revealed a melting 
temperature of 62.5°C, while the melting point of the alpha- 
mutant was found at 67.0°C. 

20 

Figure 20 : Comparison of the temperature optima of A. 
fumigatus 13073 wild-type phytase, its alpha-mutant, and a 
further stabilized alpha-mutant (E59A-S154N-R329H-S364T-G404A) . 
For the determination of the temperature optimum, the phytase 
25 standard assay was performed at a series of temperatures between 
37 and 75°C. The diluted supernatant of transformed S. 
cerevisiae strains was used for the determination. The other 
components of the supernatant had no influence on the 
determination of the temperature optimum. O, A. fumigatus ATCC 

30 13073 phytase; ^, A. fumigatus ATCC 13073 alpha-mutant; □, A. 
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fumigatus ATCC 13073 alpha -mutant- (E59A-S154N-R329H-S364T- 

G404A) -Q27T; ■ , A. fumigatus ATCC 13073 alpha-mutant- (E59A- 
S154N-R329H-S364T-G404A) -Q51T-K92A. Q51T and K92A correspond to 
consensus phytase-1 substitutions Q50T and K91A, respectively. 

5 

Figure 21 : Amino acid sequence of consensus phytase-12 
(consphyl2; SEQ ID NO: 36) which contains a number of active 
site residues transferred from the "basidio" consensus sequence 
to consensus phytase-10-thermo [3] -Q50T-K91A (underlined). 

10 

Figure 22 : DNA and amino acid sequence of consensus 
phytase-3-thermo [11] -Q5 0T. The amino acids are written below the 
corresponding DNA sequence using the one-letter code. 

is Figure 23 : DNA and amino acid sequence of consensus 

phytase-3-thermo [11] -Q50T-K91A. The amino acids are written 
below the corresponding DNA sequence using the one-letter code. 

Figure 24 : DNA and amino acid sequence of consensus 
20 phytase-10-thermo [5] -Q5 0T. The amino acids are written below the 
corresponding DNA sequence using the one-letter code. 

Figure 25 : DNA and amino acid sequence of consensus 
phytase-10-thermo [5] -Q50T-K91A. The amino acids are written 
25 below the corresponding DNA sequence using the one -letter code. 

The phytase -producing microorganism strains mentioned 
herein, viz. Paxillus involutus CBS 100231; Peniophora lycii CBS 
686.96; Agrocybe pediades CBS 900.96; and Trametes pubescens CBS 
30 100232; were isolated from natural samples originating from, 
respectively, Denmark; Denmark; Denmark; and Sweden (the Uppsala 
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collection. The samples were collected in November 1992; October 
1993; June 1995; and in November 1995, respectively. 

Example 1 
5 Consensus phytase-1 

The amino acid sequence of consensus phytase-1 (fungal 
consensus phytase, fcp) was designed and calculated as described 
in Examples 1 and 2 of EP 897985. Table 1 below shows the origin 
and vote weight of the phytase amino acid sequences used for the 
10 design of consensus phytase-1. The consensus phytase-1 sequence 
was furthermore converted into a DNA sequence as described in 
Example 3 of EP 897985, and the consensus phytase-1 gene was 
constructed and cloned as described in Example 4 of EP 897985. 

15 Table 1 

Origin and vote weight of the phytase amino acid sequences 

- phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 
(Mitchell et al . , 1997) 

- phyA from Aspergillus terreus cbs!16 .46 , aa 27, vote weight 
20 0.5 (EP 897985) 

- phyA from Aspergillus niger var. awamori, aa 27, vote weight 
0.33 [Piddington, C. S., Houston, C. S., Paloheimo, M . , 
Cantrell, M . , Miettinen-Oinonen, A., Nevalainen, H. , & Rambosek, 
J. (1993) The cloning and sequencing of the genes encoding 

25 phytase (phy) and pH 2.5-optimum acid phosphatase (aph) from 
Aspergillus niger var. awamori. Gene 133, 55-62]. 

- phyA from Aspergillus niger T213 (EP 897985), aa 27, vote 
weight 0.33 

30 - phyA from Aspergillus niger strain NRRL3135, aa 27, vote 
weight 0.33 (van Hart ingsveldt et al . , 1993) 
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- phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote 
weight 0.2 (Pasamontes et al . , 1997) 

- phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote 
weight 0.2 (EP 897985) 

5 - phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 
0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote 
weight 0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 
10 0.2 (EP 897985) 

- phyA from Emericella nidulans , aa 25, vote weight 1.0 
(Pasamontes et al . , 1997a) 

- phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote 
weight 1.0 (Pasamontes et al . , 1997a) 

15 - phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 
(Mitchell et al . , 1997) 

Example 2 

Design of an improved consensus phytase (consensus 
2 0 phytase- 10) amino acid sequence 

The alignments used for the design of consensus phytase-10 
were calculated using the program PILEUP from the GCG Sequence 
Analysis Package Release 9.0 (Devereux et al . , 1984) with the 
25 standard parameters (gap creation penalty 12, gap extension 
penalty 4) . The location of the gaps was refined using a text 
editor . 

The following sequences were used for the alignment of the 
Basiodiomycete phytases starting with the amino acid (aa) 
30 mentioned in Table 2: 
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Table 2 

Origin and vote weight of five Basidiomycete phytases used for 
the calculation of the corresponding consensus amino acid 
sequence (basidio) 

5 

- phyAl from Paxillus involutus CBS No. 100231, aa 21, vote 
weight 0.5 (WO 98/28409) 

- phyA2 from Paxillus involutus CBS No. 100231, aa 21, vote 
weight 0.5 (WO 98/28409) 

10 - phyA from Trametes pubescens CBS No. 100232, aa 24, vote 
weight 1.0 (WO 98/28409) 

- phyA from Agrocybe pediades CBS No. 900.96, aa 19, vote weight 
1.0 (WO 98/28409) 

- phyA from Peniophora lycii CBS No. 686.96, aa 21, vote weight 
15 1.0 (WO 98/28409) 

The alignment is shown in Figure 3 . 

In Table 3 the genes that were used for the final 
20 alignment are listed. The first amino acid (aa) of the sequence 
that is used in the alignment is mentioned behind the organism's 
designation . 

Table 3 

2 5 Origin and vote weight of the phytase sequences used for the 
design of consensus phytase-10 

- phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 
(Mitchell et al . , 1997) 

30 - phyA from Aspergillus terreus cbsll6.46, aa 27, vote weight 
0.5 (EP 897985) 
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- phyA from Aspergillus niger var. awamori, aa 27, vote weight 
0.5 (Piddington et al . , 1993) 

- phyA from Aspergillus niger strain NRRL313 5, aa 27, vote 
weight 0.5 (van Hartingsveldt et al . , 1993) 

5 - phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote 
weight 0.2 (Pasamontes et al . , 1997) 

- phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote 
weight 0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 
10 0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote weight 
0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 3223 9, aa 30, vote weight 
0.2 (EP 897985) 

15 - phyA from Emericella nidulans , aa 25, vote weight 1.0 
(Pasamontes et al . , 1997a) 

- phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote 
weight 1.0 (Pasamontes et al . , 1997a) 

- phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 
20 (Mitchell et al . , 1997) 

- phyA from Thermomyces lanuginosus, aa 36, vote weight 1.0 
(Berka et al . , 1998) 

- Consensus sequence of five Basidiomycete phytases, vote weight 
1.0 (Basidio, Figure 3) 

25 

The corresponding alignment is shown in Figure 4. 

Calculation of the amino acid sequence of consensus 
phytase-10 

3 0 To improve the alignment, we added the original consensus 

sequence of five phytases from four different Basidiomycetes 
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(called Basidio; still containing the undefined sequence 
positions; see Figure 3) , nearly all phytase sequences used for 
the calculation of the original consensus phytase sequences and 
one new phytase sequence from the Ascomycete Thermomyces 
5 lanuginosus to a larger alignment. 

We set plurality on 2.0 and threshold on 3. The used vote 
weights are listed in Table 3. The alignment and the 
corresponding consensus sequence are presented in Figure 4 . The 
new consensus phytase sequence has 32 different amino acids in 
io comparison to the original consensus phytase-1. Positions for 
which the program PRETTY was not able to calculate a consensus 
amino acid residue were filled according to rules mentioned in 
Example 1. None of the residues suggested by the program was 
replaced. 

15 Furthermore, in another calculation, we included all 

Basidiomycete phytases as single amino acid sequences but 
assigning a vote weight of 0.2 in the calculation. The 
corresponding alignment is shown in Figure 6. The calculated 
consensus amino acid sequence (consensus phytase- 11) has the 

20 following differences to the sequence of consensus phytase-10. 
Letter X means that the program was not able to calculate a 
consensus amino acid; the amino acid in parenthesis corresponds 
to the amino acid finally included into consensus phytase-10. 

D35X (first letter for consensus phytase-10, last letter 

25 for consensus phytase- 11 ) , X(K)69K, X(E)100E / A101R, Q134N, 
X(K)153N, X(H)190H, X(A)204S, X(E)220D / E222T, V227A, X(R)271R, 
H287A, X(D)288D / X(K) 379K, X(I)389I, E390X, X(E)415E, X(A)416A, 
X(R)446L, E463A. The numbering is as in Fig. 5. 

We also checked single amino acid replacements suggested 

3 0 by the improved consensus sequences 10 and 11 on their influence 
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on the stability of the original consensus phytase-1. The 
approach is described in example 3 . 

Conversion of the consensus phytase-10 amino acid sequence 
5 into a DNA sequence 

The first 26 amino acid residues of A. terreus cbsll6.46 
phytase were used as signal peptide and fused to the N-terminus 
of consensus phytase-10. The used procedure is further described 
in Example 1 . 

10 The resulting sequence of the fcplO gene is shown in 

Figure 5 . 

Construction and cloning of the consensus phytase-10 gene 
(fcplO) 

15 The calculated DNA sequence of fcplO was divided into 

oligonucleotides of 85 bp, alternately using the sequence of the 
sense and the anti -sense strand. Every oligonucleotide overlaps 
2 0 bp with the previous and the following oligonucleotide of the 
opposite strand. The location of all primers, purchased from 

20 Microsynth, Balgach (Switzerland) and obtained in a PAGE- 
purified form, is indicated in Figure 5. 

PCR-Reactions 

In three PCR reactions, the synthesized oligonucleotides 
25 were composed to the entire gene. For the PCR, the High Fidelity 
Kit from Boehringer Mannheim (Boehringer Mannheim, Mannheim, 
Germany) and the thermo cycler v ^The ProtokolTM" from AMS 
Biotechnology (Europe) Ltd. (Lugano, Switzerland) were used. The 
following oligonucleotides were used in a concentration of 0.2 
3 0 pMol/ml. 
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Mix 1.10: CP-1, CP-2, CP-3.10, CP-4.10, CP-5.10, CP-6 # 
CP-7.10, CP-8.10, CP-9.10, CP-10.10 

Mix 2.10: CP-9.10, CP-11.10, CP-12.10, CP-13.10, CP- 
5 14.10, CP-15.10, CP-16.10, CP-17.10, CP18.10, 

CP-19.10, CP-20.10, CP-21.10, CP-22.10 
The newly synthesized oligonucleotides are marked by the 
number 10. Consensus phytase-10 contains the following 32 
exchanges, which are underlined in Figure 5, in comparison to 
10 the original consensus phytase-1: Y54F, E58A, D69K, D70G, A94K, 
N134Q, I158V, S187A, Q188N, D197N, S204A, T214L, D220E, L234V, 
A238P, D246H, T251N, Y259N, E267D, E277Q, A283D, R291I, A320V, 
R329H, S364T, I366V, A379K, S396A, G404A, Q415E, A437G, A463E. 

Four short PCR primers were used for the assembling of the 
15 oligonucleotides : 

CP-a: Eco RI 

5 ' - TAT AT GAATTCATGGGCGTGTTCGTC - 3 ' ( SEQ ID NO : 3 7) 



20 CP-b: 



5 ' -TGAAAAGTTCATTGAAGGTTTC-3 ' (SEQ ID NO : 3 8) 



CP-C. 10 : 



5 7 - TCTTCGAAAGCAGTACACAAAC - 3 ' ( SEQ ID NO : 3 9) 



25 

CP-e: Eco RI 



5 7 -TATATGAATTCTTAAGCGAAAC-3 7 (SEQ ID NO : 40) 
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10 [xl Mix 1.10 (2.0 pmol of each 
oligonucleotide) 

2 fil nucleotides (10 mM of each nucleotide) 
2 fil primer CP- a (10 pmol /ml) 
2 fil primer CP-c.10 (10 pmol /ml) 
10,0 ill PCR buffer 

0.75 ill polymerase mixture (2.6 U) 
73.25 fil H 2 0 

10 fil Mix 2.10 (2.0 pmol of each 

oligonucleotide) 

2 pil nucleotides (10 mM each nucleotide) 
2 ill primer CP-b (10 pmol /ml) 
2 jil primer CP-e (10 pmol /ml) 
10, 0 fil PCR buffer 

0.75 ill polymerase mixture (2.6 U) 
73 .25 jil H 2 0 

Reaction conditions for PCR reactions a and b: 



step 


1 


2 min - 


45°C 


step 


2 


30 sec 


- 72°C 


step 


3 


30 sec 


- 94°C 


step 


4 


30 sec 


■ 52°C 


step 


5 


1 min - 


72°C 



25 

Steps 3 to 5 were repeated 40-times. 

The PCR products (670 and 905 bp) were purified by agarose 
gel electrophoresis (0.9% agarose), followed by gel extraction 
30 (QIAEX II Gel Extraction Kit, Qiagen, Hilden, Germany) . The 
purified DNA fragments were used for the PCR reaction c. 



PCR reaction a: 



5 



10 PCR reaction b: 



15 
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PCR reaction c: 6 /xl PCR product of reaction a »50 ng) 

6 /xl PCR product of reaction b «50 ng) 
2 /xl primer CP- a (10 pmol/ml) 
5 2 /xl primer CP-e (10 pmol/ml) 

10, 0 /xl PCR buffer 

0.75 /xl polymerase mixture (2.6 U) 
73.25 /xl H 2 0 



io Reaction conditions for PCR reaction c: 
step 1 2 min - 94 °C 

step 2 30 sec - 94°C 

step 3 30 sec - 55°C 

step 4 1 min - 72°C 



15 



Steps 2 to 4 were repeated 31-times. 



The resulting PCR product (1.4 kb) was purified as 
mentioned above, digested with EcoRI, and ligated in an EcoRI - 

20 digested and dephosphorylated pBsk (-) -vector (Stratagene, La 
Jolla, CA, USA) . 1 /xl of the ligation mixture was used to 
transform E. coli XL-1 competent cells (Stratagene, La Jolla, 
CA, USA) . All standard procedures were carried out as described 
by Sambrook et al . (1987). The DNA sequence of the constructed 

25 gene (fcplO) was checked by sequencing as known in the art. 
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Example 3 

Increasing the thermostability of consensus phytase-1 by 
introduction of single mutations suggested by the amino 
acid sequences of consensus phytase-10 and consensus 
5 phytase-11 

In order to increase the thermostability of homologous 
genes, it is also possible to test the stability effect of each 
differing amino acid residue between the protein of interest and 

10 the calculated consensus sequence and to combine all stabilizing 
mutations into the protein of interest. We used the consensus 
phytase-1 as protein of interest and tested the effect on the 
protein stability of 34 amino acid residues that differ relative 
to consensus phytase-10 and/or -11 by single site-directed 

15 mutagenesis. 

To construct muteins for expression in A. niger, S. 
cerevisiae, or H. polymorpha, the corresponding expression 
plasmid containing the consensus phytase-1 gene was used as 
template for site-directed mutagenesis (see Examples 6-8) . 

20 Mutations were introduced using the "quick exchangeTM site- 
directed mutagenesis kit" from Stratagene (La Jolla, CA, USA) 
following the manufacturer's protocol and using the 
corresponding primers. All mutations made and the corresponding 
primers are summarized in Table 4 . Plasmids harboring the 

25 desired mutation were identified by DNA sequence analysis as 
known in the art . 
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Table 4 

Primers used for site-directed mutagenesis of consensus phytase- 
1 

Exchanged bases are highlighted in bold. The introduction of a 
5 restriction site is marked above the sequence. When a 
restriction site is written in parenthesis, the mentioned site 
was destroyed by introduction of the mutation. 



mutation Primer set 

10 Kpn I 

Q50T 5 7 -CACTTGTGGGGTACCTACTCTCCATACTTCTC-3 7 (SEQ ID NO : 41) 

5 7 -GAGAAGTATGGAGAGTAGGTACCCCACAAGTG-3 ' 



Y54F 5' -GGTCAATACTCTCCATTCTTCTCTTTGGAAG-3 7 (SEQ ID NO: 42) 

15 5 7 - CTTCCAAAGAGAAGAATGGAGAGTATTGACC - 3 ' 

E58A 5 7 -CATACTTCTCTTTGGCAGACGAATCTGC-3 7 (SEQ ID NO: 43) 

5 7 - GCAGATTCGTCTGCCAAAGAGAAGTATG- 3 7 

2 o Aa t II 

D69K 5 7 -CTCCAGACGTCrCCAAAGGACTGTAGAGTTAC-3 7 (SEQ ID NO: 44) 

5 7 -GTAACTCTACAGTCCTTTGGGACGTCTGGAG-3 7 

Aat II 

25D70G 5 7 - CTCCAGACGTCCCAGACGGCTGTAGAGTTAC - 3 7 (SEQ ID NO: 45) 

5 7 -GTAACTCTACAGCCGTCTGGGACGTCTGGAG-3 7 

K9 1A 5 7 - GATACC CAACTTCTTCTGCGTCTAAGGCTTACTCTG - 3 7 

(SEQ ID NO: 46) 

3 0 5 7 - CAGAGTAAGCCTTAGACGCAGAAGAAGTTGGGTATC - 3 ' 



Sea I 

A94K 5 7 -CTTCTAAGTCTAAGAAGTACTCTGCTTTG-3 7 (SEQ ID NO:47) 

5 7 -CAAAGCAGAGTACTTCTTAGACTTAGAAG-3 7 

35 

Al 0 1R 5 7 -GCTTACTCTGCTTTGATTGAACGGATTCAAAAGAACGCTAC - 3 7 

(SEQ ID NO: 48) 
5 7 -GTAGCGTTCTTTTGAATCCGTTCAATCAAAGCAGAGTAAGC - 3 7 



40 N134Q 5 7 -CCATTCGGTGAACAGCAAATGGTTAACTC-3 1 (SEQ ID NO: 49) 
5 7 -GAGTTAACCATTTGCTGTTCACCGAATGG- 3 7 
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Nru I 

K153N 5' -GATACAAGGCTCTCGCGAGAAkCATTGTTC -3 7 (SEQ ID NO: 50) 
5 ' -GGAACAATGTTTC7CGCGAGAGCCTTGTATC-3 ' 

5 Bss HI 

I158V 5 7 -GATTGTTCCATTCGTGCGCGCTTCTGGTTC-3 ' (SEQ ID NO : 51) 
5 ' -GAACCAGAAGCGCGCACGAATGGAACAATC-3 7 

Apa I 

10 S187A 5 7 -GGCTGACCCAGGGGCCCAACCACACCAAGC-3 7 (SEQ ID NO: 53) 
5' - GCTTGGTGTGGTT GGGCC CCTGGGTCAGC C - 3 ' 

Bel I 

D197N 5 7 -CTCCAGTTATTAACGTGATCATTCCAGAAGG-3 ' (SEQ ID NO: 52) 
15 5 7 - CCTTCTGGAATGATCACGTTAATAACTGGAG- 3 7 

NCO I 

T214L 5 ' -CACTTTGGACCATGGTCTTTGTACTGCTTTCG-3 ' (SEQ ID NO : 54) 
5 ' - CGAAAGCAGTACAAAGACCA TGGTCCAAAGTG- 3 ' 

20 

Avr II 

E222T 5 ' -GCTTTCGAAGACTCTACCCTAGGTGACGACGTTG- 3 ' 

(SEQ ID NO: 55) 

5 ' -CAACGTCGTCACCTAGGGTAGAGTCTTCGAAAGC-3 ' 

25 

V2 2 7 A 5 7 - GGTGACGACGCTGAAGCTAACTTCAC - 3' (SEQ ID NO: 56) 
5 7 -GTGAAGTTAGCTTCAGCGTCGTCACC-3 7 

Sac II 

30 L234V 5 7 -CTAACTTCACCGCGGTGTTCGCTCCAG-3 7 (SEQ ID NO: 57) 
5 ' - CTGGAGCGAACACCGCGGTGAAGTTAG- 3 7 

A2 3 8P 5 7 -GCTTTGTTCGCTCCACCTATTAGAGCTAGATTGG- 3 7 

(SEQ ID NO: 58) 

3 5 5 7 - CCAATCTAGCTCTAATAGGTGGAGCGAACAAAGC - 3 7 

Hpa I 

T251N 5 7 -GCCAGGTGTTAACTTGACTGACGAAG-3 7 (SEQ ID NO: 59) 
5 7 -TTCGTCAGTCAAGTTAACACCTGGC-3 7 



40 



Aat II 

Y2 5 9N 5 7 - GACGAAGACGTCGTTAACTTGATGGAC - 3 7 ( SEQ ID NO : 60) 
5 7 -GTCCATCAAGTTAACGACGTCTTCGTC-3 7 



45 

E267D 



Asp I 

5 7 -GTCCATTC GACA CTGTCGCTAGAACTTC- 3 7 ( SEQ ID NO : 61) 
5 7 - GAAGTTCTAGCGACAGTGTCGAATGGAC - 3 ' 
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E277Q 5 ' -CTGACGCTACTCAGCTGTCTCCATTC-3 7 (SEQ ID NO: 62) 

5 7 -GAATGGAGACAGCTGAGTAGCGTCAG-3 ' 

5 A283D 5 ' -GTCTCCATTCTGTGATTTGTTCACTCAC-3 7 (SEQ ID NO: 63) 
5 ' - GTGAGTGAACAAATCACAGAATGGAGAC - 3 ' 

Ksp I 

H287A 5 7 -GCTTTGTTCACCGCGGACGAATGGAG-3 7 (SEQ ID NO: 64) 
10 5 7 - CTCCATTCGTCCGCGGTGAACAAAGC - 3 ' 

Bam HI 

R2 91I 5 ' - CACGACGAAT GGATC CAATACGACTAC - 3 ' ( SEQ ID NO : 65) 

5 ' - GTAGTCGTATTGSA TCCATTCGTCGTG- 3 ' 

15 

Bsi WI 

Q292A 5 7 -GACGAATGGAGAGCGTACGACTACTTG- 3 ' (SEQ ID NO: 66) 
5 7 -CAAGTAGTCGTACGCTCTCCATTCGTC-3 ' 

2 o Hpa I 

A320V 5 ' -GGTGTTGGTTTCGTTAACGAATTGATTGC-3 ' (SEQ ID NO: 67) 
5 ' -GCAATCAATTCGTTAACGAAACCAACACC-3 ' 

(Bgl II) 

25 R329H 5 1 -GCTAGATTGACTCACTCTCCAGTTCAAG-3 ' (SEQ ID NO: 68) 
5 1 -CTTGAACTGGAGAGTGAGTCAATCTAGC-3 1 

Eco RV 

S364T 5 7 -CTCACGACAACACTATGATilTCTATTTTCTTC-3 ' (SEQ ID NO : 69) 

3 0 5 7 - GAAGAAAATA GATA TCATAGTGTTGTCGTGAG - 3 7 

Nco I 

I366V 5 7 -CGACAACT CCA TGGTTTCTATTTTCTTCGC- 3 7 (SEQ ID NO: 70) 
5 7 -GCGAAGAAAATAGAAACCATOGAGTTGTCG-3 7 

35 

Kpn I 

A379K 5 7 -GTACAACGGTACCAAGCCATTGTCTAC-3 ' (SEQ ID NO: 71) 
5 7 - GTAGACAATGGCTTGGTACCGTTGTAC - 3 7 

40 S396A 5 7 -CTGACGGTTACGCTGCTTCTTGGAC-3 7 (SEQ ID NO : 72) 
5 7 -GTCCAAGAAGCAGCGTAACCGTCAG-3 7 

G404A 5 7 -CTGTTCCATTCGCTGCTAGAGCTTAC-3 7 (SEQ ID NO: 73) 
5 7 - GTAAGCTCTAGCAGCGAATGGAACAG- 3 7 

45 

Q415E 5 7 -GATGCAATGTGAAGCTGAAAAGGAACC-3 7 (SEQ ID NO: 74) 

5 7 -GGTTCCTTTTCAGCTTCACATTGCATC-3 7 
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Sal I 

A437G 5' -CACGGTTGTGGTGTCGACAAGTTGGG-3' (SEQ ID NO: 75) 
5 ' - CCCAACTTGTCGACACCACAACCGTG- 3 ' 

5 

Mun I 

A463E 5 ' -GATCTGGTGGCAATTGGGAGGAATGTTTCG-3 ' (SEQ ID NO: 76) 

5 ' -CGAAACATTCCTCCCAATTGCCACCAGATC-3 ' 

io and, accordingly, for other mutations. 

The temperature optimum of the purified phytases, 
expressed in Saccharomyces cerevisiae (Example 7) , was 
determined as outlined in Example 9. Table 5 shows the effect of 
15 each mutation introduced on the stability of consensus phytase- 
1. 

Table 5 

Stability effect of the individual amino acid replacements in 

20 consensus phytase-1 

+ or - means a positive, respectively, negative effect on the 
protein stability up to 1°C, ++ and -- means a positive, 
respectively, negative effect on the protein stability between 1 
and 3°C; the numbers 10 or 11 in parentheses indicate the 

25 consensus phytase sequence that suggested the amino acid 
replacement . 
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We combined eight positive mutations (E58A, D197N, E267D, 
R291I, R329H, S364T, A379K, G404A) in consensus phytase-1 
5 thermo[8], using the primers and the technique mentioned above 
in this example. Furthermore, the mutations Q50T and/or K91A 
were introduced which mainly influence the catalytic 
characteristics of phytase (see patent applications EP 897010 
and EP 897985, as well as Example 9) . The DNA and amino acid 

io sequence of the resulting phytase (consensus phytase- 1- 
thermo [8] -Q50T-K91A) are shown in Figure 7. In this way, the 
temperature optimum and the melting point of the consensus 
phytase were increased by 7°C (Figures 15, 16, 17) . 

In a further consensus protein, we combined eleven 

15 positive mutations (E58A, D69K, D197N, T214L, E222T, E267D, 
R291I, R329H, S364T, A379K, G404A) in consensus phytase-1 
thermo[ll]. Furthermore, the mutations Q50T and/or K91A were 
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introduced. In this way, the melting temperature was increased 
by another 3-4°C when compared to consensus phytase-1 thermo[8] . 

Using the results of Table 5, we further improved the 
thermostability of consensus phytase-10 by the back mutations 
5 K94A, V158I, and A396S, the reverse of which (A94K, I158V, and 
S3 96A) revealed a strong negative influence on the stability of 
consensus phytase-1. The resulting protein was called consensus 
phytase-10-thermo [3] . SEQ ID NO: 26 plus the three mutations 
K94A, V158I, and A396S. Furthermore, we introduced the mutations 

io Q50T and K91A that mainly influence the catalytic 
characteristics of consensus phytase (see patent applications EP 
897010 and EP 897985, as well as Example 9 and Figures 14 and 
15) . The resulting DNA and amino acid sequence are shown in 
Figure 8. The optimized phytase showed a 4°C higher temperature 

is optimum and melting point than consensus phytase-10 (Figures 12 
and 13) . Furthermore, the phytase has also a strongly increased 
specific activity with phytate as substrate of 250 U/mg at pH 
5 . 5 (Figure 14) . 

In a still further consensus protein, two additional 

20 mutations were introduced into consensus phytase-10 thermo[3] 
(E222T, G437A) which yielded consensus phytase-10 thermo[5]. 
Furthermore, the mutations Q50T and/or K91A were introduced. In 
this way, the melting temperature was increased by another 1-2 °C 
when compared to consensus phytase-10 thermo[3]. 

25 
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Example 4 

Stabilization of the phytase of A. fumigatus ATCC 13073 by 
replacement of amino acid residues with the corresponding 
consensus phytase-1 and/or consensus phytase-10 residues 

5 

At six amino acid sequence positions where A. fumigatus 
13 0 73 phytase is the only or nearly the only phytase in the 
alignment of Figure 1 that does not contain the corresponding 
consensus phytase amino acid residue, the non-consensus amino 

10 acid residue was replaced by the consensus one. The following 
amino acids were substituted in A. fumigatus 13 073 phytase, 
containing additionally the Q51(24)T substitution (influencing 
the catalytic properties and corresponding to the Q50T 
substitution in the consensus phytases) and the signal sequence 

is of A. terreus cbsll6.46 phytase (see European Patent Application 
No. 0897010, and Figure 9): F55(28)Y / V100(73)I, F114(87)Y, 
A243(220)L, 8265(242)?, N294(282)D. The numbers in parentheses 
refer to the numbering in Figure 1. 

In a second round, four of the seven stabilizing amino 

20 acid exchanges (E58A, R329H, S364T, G404A) identified in 
consensus phytase-10 and tested as single mutations in consensus 
phytase-1 (Table 5) were additionally introduced into the A. 
fumigatus alpha-mutant. Furthermore, the amino acid replacement 
S154N, shown to reduce the protease susceptibility of the 

25 phytase, was introduced. 

The mutations were introduced as described in Example 3 
(see Table 6) and expressed as described in Examples 6 to 8 . The 
resulting A. fumigatus 13 073 phytase variants were called alpha- 
mutant (i.e. the A. fumigatus ATCC 13073 phytase with the 

30 substitutions Q24T, F28Y, V73I, F87Y, A220L, S242P, N282D) and 
* x optimized 7 ' alpha-mutant (i.e. the A. fumigatus alpha-mutant 
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having the additional substitutions E59A-S154N-R329H-S364T- 
G4 04A) . K92A is an additional preferred mutation. 

The temperature optimum (60 °C, Figure 2 0) and the melting 
temperature (67.0°C, Figure 19) of the A. fumigatus 13073 alpha- 
5 mutant phytase were increased by 5-7°C in comparison to the 
values of the wild-type phytase (temperature optimum: 55°C, Tm: 
60°C) . The five additional amino acid replacements further 
increased the temperature optimum by 3°C (Figure 20) . 

10 Table 6 

Mutagenesis primers for the stabilization of A, fumigatus ATCC 
13073 phytase 



Mutation 
15 F55Y 



Primer 

5 7 -CACGTACTCGCCATACTTTTCGCTCGAG-3 ' (SEQ ID NO: 77) 
5 ' - CTCGAGCGAAAAGTATGGCGAGTACGTG- 3 ' 



E58A 



20 



{Xho I) 

5' - CCATACTTTTCG CTCGC GGACGAGCTGTCCGTG- 3 ' 

(SEQ ID NO: 78) 

5' -CACGGACAGCTCGTCCGCGAGCGAAAAGTAGG-3 ' 



V100I 



25 



5 ' -GTATAAGAAGCTTATTACGGCGATCCAGGCC-3 ' 



5' -GGCCTGGATCGCCGTAATAAGCTTCTTATAC-3 ' 



(SEQ ID NO: 79) 



30 



F114Y 



A243L 



5' - CTTCAAGGGCAAGTACGCCTTTTTGAAGACG- 3 ' 



5 ' -CGTCTTCAAAAAGGCGTACTTGCCCTTGAAG-3 1 



(SEQ ID NO: 80) 



5 ' - CATCCGAGCTCGCCTCGAGAAGCATCTTC - 3 ' ( SEQ ID NO : 81 ) 
5 ' - GAAGATGCTTCTCGAGGCGAGCTCGGATG - 3 ' 



S265P 



35 



5 ' - CTAATGGA TGTGTCCGTTTGATACGGTAG- 3 ' (SEQ ID NO: 82) 
5 ' - CTACCGTATCAAACGGACACATGTCCATTAG- 3 ' 



N2 94D 



5 ' - GTGGAAGAAGTACGACTACCTTCAGTC - 3 ' ( SEQ ID NO : 83) 
5 1 -GACTGAAGGTAGTCGTACTTCTTCCAC-3 ' 
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(Aflu I) 

R329H 5 ' -GCCCGGTTGACGCATTCGCCAGTGCAGG-3 ' (SEQ ID NO: 84) 

5 ' - CCTGCACTGGCGAATGCGTCAACCGGGC - 3 ' 

5 Nco I 

S364T 5 7 -CACACGACAACACCATGGTTTCCATCTTC-3 ' (SEQ ID NO: 85) 

5 ' -GAAGATGGAAACCATGGTGTTGTCGTGTG-3 ' 

(Bss HI) 

10 G404A 5' -GTGGTGCCTTTCGCCGCGCGAGCCTACTTC-3 ' (SEQ ID NO: 86) 

5 ' -GAAGTAGGCTCGCGCGGCGAAAGGCACCAC- 3 ' 



Example 5 

Introduction of the active site amino acid residues of A. 

15 niger NRRL 313 5 phytase into consensus phytase-1 

We used the crystal structure of Aspergillus niger 
NRRL 3135 phytase to define all active site amino acid residues 
(see Example 1, and EP 897010) . Using the alignment of Figure 1, 
we replaced the following active site residues and additionally 

20 the non- identical adjacent ones of consensus phytase-1 by those 
of A. niger phytase: 

S89D, S92G, A94K, D164S, P201S, G203A, G205S, H212P, 
G224A, D22 6T, E255T, D256E, V258T, P2 65S, Q2 92H, G3 00K, Y3 05H, 
A314T, S364G, M365I, A397S, S398A, G404A, and A405S. 

25 The new consensus phytase- 7 protein sequence was 

backtranslated into a DNA sequence (Figure 10) as described in 
Example 1. The corresponding gene (fcp7) was generated as 
described in Example 1 using the following oligonucleotide 
mixes : 

30 Mix 1.7: CP-1, CP-2, CP-3, CP-4.7, CP-5.7, CP-6, CP-7, CP- 

8.7, CP-9, CP-10.7 



Mix 2.7: 

35 



CP-9, CP-10.7, CP-11.7, CP-12.7, CP-13.7, CP-14.7, 
CP-15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, 
CP-21, CP-22. 
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The DNA sequences of the oligonucleotides are indicated in 
Figure 10. The newly synthesized oligonucleotides are 
additionally marked by the number 7. After assembling of the 
5 oligonucleotides using the same PCR primers as mentioned in 
Example 1, the gene was cloned into an expression vector as 
described in Examples 6-8. 

The pH-profile of the enzyme determined after expression 
in H. polymorpha and purification was very similar to that of A. 
10 niger phytase (see Figure 18) . 

Example 6 

Expression of the consensus phytase genes in Hansenula 
polymorpha 

15 The phytase expression vectors used to transform H. polymorpha 
RB11 [Gellissen, G., Hollenberg, C. P., Janowicz, Z. A. (1994) 
Gene expression in methylotrophic yeasts, in Smith, A. (ed.) 
Gene expression in recombinant microorganisms. Dekker, New York, 
pp. 395-439] were constructed by inserting the Eco RI fragment 

20 of pBsk-fcp or variants thereof into the multiple cloning site 
of the H. polymorpha expression vector pFPMT121, which is based 
on an ura3 selection marker from S. cerevisiae, a formate dehy- 
drogenase (FMD) promoter element and a methanol oxidase (MO) 
terminator element from H. polymorpha. The 5' end of the fcp 

25 gene is fused to the FMD promoter, the 3' end to the MOX termi- 
nator (Gellissen et al . , Appl . Microbiol. Biotechnol . 46, 46-54, 
1996; EP 299108) . The resulting expression vectors are desig- 
nated pFPMTfcp, pFPMTfcplO, and pFPMTf cp7 . 

The constructed plasmids were propagated in E. coli. 
30 Plasmid DNA was purified using standard state of the art 
procedures. The expression plasmids were transformed into the H. 
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polymorpha strain RB11 deficient in orotidine- 5 7 -phosphate 
decarboxylase (ura3) using the procedure for preparation of 
competent cells and for transformation of yeast as described in 
Gellissen et al . (1996). Each transformation mixture was plated 
5 on YNB medium (0.14% w/v Difco YNB and 0.5% ammonium sulfate) 
containing 2% glucose and 1.8% agar, and incubated at 3 7 °C. 
After 4 to 5 days individual transformant colonies were picked 
and grown in the liquid medium described above for 2 days at 3 7 
°C. Subsequently, an aliquot of this culture was used to 

10 inoculate fresh vials with YNB-medium containing 2% glucose. 
After seven further passages in selective medium, the expression 
vector had integrated into the yeast genome in multimeric form. 
Subsequently, mitotically stable transf ormants were obtained by 
two additional cultivation steps in 3 ml non-selective liquid 

is medium (YPD, 2% glucose, 10 g/1 yeast extract, and 2 0 g/1 
peptone) . In order to obtain genetically homogeneous recombinant 
strains, an aliquot from the last stabilization culture was 
plated on a selective plate . Single colonies were isolated for 
analysis of phytase expression in YNB containing 2% glycerol 

20 instead of glucose to derepress the FMD promoter . Purification 
of the consensus phytases was done as described in Example 7 . 

Example 7 

Expression of the consensus phytase genes in Saccharomyces 
2 5 cerevisiae and purification of the phytases from the 

culture supernatant 

The consensus phytase genes were isolated from the 
corresponding Bluescript-plasmid (pBsk-fcp, pBSK-fcplO, pBsk- 
30 fcp7) and ligated into the Eco RI sites of the expression 
cassette of the Saccharomyces cerevisiae expression vector pYES2 
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(Invitrogen, San Diego , CA, USA) or subcloned between the 
shortened GAPFL (glyceraldhyde- 3 -phosphate dehydrogenase) 
promoter and the pho5 terminator as described by Janes et al . , 
Curr. Genet. 18, 97-103. The correct orientation of the gene was 
5 checked by PCR. Transformation of S. cerevisiae strains, e.g. 
INVScl (Invitrogen, San Diego, CA, USA) , was done according to 
Hinnen et al . , Proc . Natl. Acad. Sci. USA 75, 1929-1933 (1978). 
Single colonies harboring the phytase gene under the control of 
the GAPFL promoter were picked and cultivated in 5 ml selection 

10 medium [SD-uracil; Sherman, J. P., Finck, G. R. & Hicks, J. B. 
(1986) Laboratory course manual for methods in yeast genetics . 
Cold Spring Harbor University] at 30 °C under vigorous shaking 
(2 5 0 rpm) for one day. The preculture was then added to 500 ml 
YPD medium (Sherman et al . , 198 6) and grown under the same 

15 conditions. Induction of the gall promoter was done according to 
the manufacturer's instructions. After four days of incubation, 
the cell broth was centrifuged (7000 rpm, GS3 rotor, 15 min, 
5°C) to remove the cells , and the supernatant was concentrated 
by way of ultrafiltration in Amicon 84 0 0 cells (PM3 0 membranes ; 

20 Grace AG, Wallizeller, Switzerland) and ultrafree-15 centrifugal 
filter devices (Biomax-30K, Millipore, Bedford, MA, USA) . The 
concentrate (10 ml) was desalted on a 4 0 ml Sephadex G2 5 
Superfine column (Pharmacia Biotech, Freiburg, Germany) , with 10 
mM sodium acetate, pH 5.0, serving as elution buffer. The 

25 desalted sample was brought to 2 M (NH 4 ) 2 S0 4 and directly loaded 
onto a 1 ml Butyl Sepharose 4 Fast Flow hydrophobic interaction 
chromatography column (Pharmacia Biotech, Feiburg, Germany) 
which was eluted with a linear gradient from 2 M to 0 M (NH 4 ) 2 S0 4 
in 10 mM sodium acetate, pH 5.0. Phytase was eluted in the 

30 breakthrough, concentrated and loaded on a 120 ml Sephacryl S- 
300 gel permeation chromatography column (Pharmacia Biotech, 
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Freiburg , Germany) . Consensus phytases -1, -7 and -10 eluted as 
a homogeneous symmetrical peak and were shown by SDS-PAGE to be 
approx. 95% pure. 



5 Example 8 

Expression of the consensus phytase genes in Aspergillus 
niger 

The Bluescript-plasmids pBsk-fcp, pBsk-fcplO, and pBsk- 
fcp7 were used as template for the introduction of a Bsp HI -site 
10 upstream of the start codon of the genes and an Eco RV-site 
downstream of the stop codon. The ExpandTM High Fidelity PCR Kit 
(Boehringer Mannheim, Mannheim, Germany) was used with the 
following primers: 

15 Primer Asp-1: 

Bsp HI 

5 ' -TATATCATGAGCGTGTTCGTCGTGCTACTGTTC-3 ! (SEQ ID NO: 87) 



Primer Asp-2 used for cloning of fcp and f cp7 : 

20 

Eco RV 

3 ' -ACCCGACTTACAAAGCGAATTCTATAGATATAT-5 ' (SEQ ID NO : 88) 



Primer Asp-3 used for cloning of fcplO: 

25 

Eco RV 

3 ' -ACCCTTCTTACAAAGCGAATTCTATAGATATAT-5' (SEQ ID NO: 89) 



The reaction was performed as described by the supplier. 
30 The PCR-amplif ied fcp-genes had a new Bsp HI site at the start 
codon, introduced by primer Asp-1, which resulted in a 
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replacement of the second amino acid residue glycine by serine. 
Subsequently, the DNA- fragment was digested with Bsp HI and Eco 
RV and ligated into the Nco I site downstream of the 
glucoamylase promoter of Aspergillus niger (glaA) and the Eco RV 
5 site upstream of the Aspergillus nidulans tryptophan C 
terminator (trpC) (Mullaney et al . , 1985). After this cloning 
step, the genes were sequenced to detect possible errors 
introduced by PCR. The resulting expression plasmids, which 
basically correspond to the pGLAC vector as described in Example 
io 9 of EP 684313, contained the orotidine- 5 ' -phosphate 
decarboxylase gene (pyr4) of Neurospora crassa as a selection 
marker. Transformation of Aspergillus niger and expression of 
the consensus phytase genes was done as described in EP 684313. 
The consensus phytases were purified as described in Example 7. 

15 

Example 9 

Determination of phytase activity and of the pH and 
temperature optima 

This example relates i.a. to the determination of phytase 
20 activity and of the temperature optimum. Various phytases have 
been tested. 

The phytase of Aspergillus niger NRRL 3135 was prepared as 
described in EP 420358 and by van Hartingsveldt et al . (Gene 
127, 87-94, 1993) . 

25 The phytases of Aspergillus fumigatus ATCC 13073, 

Aspergillus terreus 9A-1, Aspergillus terreus cbsll6.46, 
Emericella nidulans, Myceliophthora thermophila, and 
Talaromyces thermophilus were prepared as described in 
EP-0897985 and in the references therein. 

3 0 The remaining phytases tested were prepared as described 

herein . 
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Consensus phytase-1- thermo (8) designates a variant of 
consensus phytase-1, which further comprises the eight mutations 
which are underlined in the legend to Figure 5. Consensus 
phytase-1 is shown in Fig. 1 (SEQ ID NO: 14) without signal 
5 peptide, and in Fig. 2 (SEQ ID NO: 16) with the signal peptide. 

Phytase activity was determined basically as described by 
Mitchell et al . (1997). The activity was measured in an assay 
mixture containing 0.5% phytic acid («5 mM) in 200 mM sodium 
acetate, pH 5.0. After 15 min of incubation at 37°C / the reac- 

10 tion was stopped by addition of an equal volume of 15% tri- 
chloroacetic acid. The liberated inorganic phosphate was quanti- 
fied by mixing 100 fil of the assay mixture with 900 fil H 2 0 and 1 
ml of 0.6 M H 2 S0 4/ 2% ascorbic acid and 0.5% ammonium molybdate. 
Standard solutions of potassium phosphate were used as refer- 

15 ence. One unit of enzyme activity was defined as the amount of 
enzyme that releases 1 ^imol phosphate per minute at 37°C. The 
protein concentration was determined using the enzyme extinction 
coefficient at 280 nm calculated according to Pace et al . [Pace 
N. C, Vajdos, F., Fee, L., Grimsley, G. & Gray, T. (1995) How 

20 to measure and predict the molar absorption coefficient of a 
protein. Prot . Sci . 4, 2411-2423]: 1 absorption unit (1 OD) at 
280 nm corresponds to 1.101 mg/ml of consensus phytase-1, 1.068 
mg/ml of consensus phytase-7, and 1.039 mg/ml of consensus phy- 
tase-10 . 

25 In case of pH-optimum curves, the purified enzymes were 

diluted in 10 mM sodium acetate, pH 5.0. Incubations were 
started by mixing aliquots of the diluted protein with an equal 
volume of 1% phytic acid («10 mM) in a series of different 
buffers: 0.4 M glycine/HCl, pH 2.5; 0.4 M acetate/NaOH, pH 3.0, 

30 3.5, 4.0, 4.5, 5.0, 5.5; 0.4 M imidazole/HCl , pH 6.0, 6.5; 0.4 M 
Tris/HCl pH 7.0, 7.5, 8.0, 8.5, 9.0. Control experiments showed 
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that pH was only slightly affected by the mixing step. 
Incubations were performed for 15 min at 3 7°C as described 
above . 

For determination of the substrate specificities of the 
5 phytases, phytic acid in the assay mixture was replaced by 5 mM 
concentrations of the respective phosphate compounds. Besides, 
the activity tests were performed as described above. 

For determination of the temperature optimum, enzyme (100 
pi) and substrate solution (100 /xl) were pre-incubated for 5 min 
10 at the given temperature . The reaction was started by addition 
of the substrate solution to the enzyme. After 15 min of 
incubation, the reaction was stopped with trichloroacetic acid, 
and the amount of phosphate released was determined. 

The pH-optimum of consensus phytase-1 was around pH 6.0- 
15 6.5 (70 U/mg) . Introduction of the Q50T mutation shifted the 
pH-optimum to pH 6.0 (130 U/mg). Introduction of the K91A 
mutation further shifted the pH optimum into the more acidic pH- 
range . Comparable effects of the Q50T and K91A mutations were 
also observed for consensus phytase-10 and for further 
20 stabilized consensus phytase variants (Figures 14 and 15) . 

Consensus phytase-7, which was constructed to transfer the 
catalytic characteristics of A. niger NRRL 3135 phytase to 
consensus phytase-1, had a pH-profile very similar to that of A. 
niger NRRL 3135 phytase (see Figure 18) . The substrate 
25 specificity also resembled more that of A. niger NRRL 3135 
phytase than that of consensus phytase-1. 

The temperature optimum of consensus phytase-1 (71°C) was 
16-26°C higher than the temperature optima of the wild-type 
phytases (45-55°C, Table 7) that were used to calculate the 
3 0 consensus sequence. The improved consensus phytase-10 showed a 
further increase of its temperature optimum to 80 °C (Figure 13) . 
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The temperature optimum of consensus phytase- 1- thermo [8] 
was found to be in the same range (78 °C) when using the 
supernatant of an overproducing S. cerevisiae strain. The 
highest temperature optimum reached of 82 °C was determined for 
5 consensus phytase- 10 -thermo [3] -Q50T-K91A. Table 7 



Temperature optima and Tm-values of consensus phytase and 
of the phytases from A. fumigatus, A. niger, E. nidulans, 
and M. thermophila. 



The determination of the temperature optimum was performed 
as described in Example 9 . The Tm-values were determined by 
differential scanning calorimetry as described in Example 10. 



Phytase 


Optimum temperature 
<°C) 


Tm (°C) 


Aspergillus niger 
NRRL 313 5 


55 


63 .3 


Aspergillus 
fumigatus ATCC 13 073 


55 


62 . 5 


Aspergillus terreus 
9A-1 


49 


57 .5 


Aspergillus terreus 
cbsll6.46 


45 


58 .5 


Emericella nidulans 


45 


55 .7 


Myceliophthora 
thermophila 


55 




Talaromyces 
thermophi lus 


45 




Consensus phytase- 




90.4 


10-thermo [5] -Q50T- 






K91A 






Consensus -phytase- 
10-thermo [3] -Q50T- 
K91A 


82 


89.3 
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Consensus -phytase - 
10-thermo [3] -Q50T 


82 


88.6 


Consensus -phytase- 10 


80 


85 .4 


Consensus phytase- 1- 
unermo L -L J- J ~ v ± — r^z? ±£\ 


- 


88 . 0 


Consensus phytase - 1 - 
unermo L-L-LJ -you± 


- 


88.5 


Consensus -phytase - 1 - 
tnermo lo } ~su-> u J- ~ J--^- 


- 


85 . 7 


Consensus -phytase - 1 - 

U. ne 27TT10 [o J yDUl 


78 


84 .7 


Consensus -phytase - 1 - 
thermo [8] 


81 


- 


Consensus -phytase - 1 - 
thermo [3] 


75 


- 


Consensus -phytase- 1 - 
Q50T 




78 . 9 


Consensus -phytase - 1 


71 


78 . 1 


Aspergillus 
f umigatus ot- mutant 
Q51T 


60 


67.0 


Aspergillus 
f umigatus a-mutant , 
plus mutations E59A, 

blb4JM / KizyH, bjo41, 
G4 04A 


63 


- 


Aspergillus 
f umigatus 
'^optimized' ' alpha- 
mutant , plus 
mutation K92A 


63 
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Example 10 

Determination of the melting temperature by differential 
scanning calorimetry (DSC) 
5 In order to determine the unfolding temperature of the 

phytases, differential scanning calorimetry was applied as de- 
scribed by Brugger et al . , 1997 [Brugger, R. , Mascarello, F., 
Augem, S., van Loon, A. P. G. M. & Wyss, M. (1997). Thermal de- 
naturation of phytases and pH 2.5 acid phosphatase studied by 
10 differential scanning calorimetry. In The Biochemistry of phy- 
tate and phytase (eds. Rasmussen, S.K.; Raboy, V.; Dalboge, H. 
and Loewus, F.; Kluwer Academic Publishers, Dordrecht, the Neth- 
erlands] . Solutions of 50-60 mg/ml of homogeneous phytase were 
used for the tests. A constant heating rate of 10°C/min was ap- 
15 plied up to 90-95°C. 

The determined melting points confirm the results obtained 
for the temperature optima (Table 7) . The most stable consensus 
phytase designed so far is consensus phytase- 10 -thermo [3] -Q50T- 
K91A showing a melting temperature under the chosen conditions 
20 of 89.3°C. This is 26.0 to 33.6°C higher than the melting 
temperature of the wild- type phytases used. 

Example 11 

Transfer of basidiomycete phytase active site into 
25 consensus phytase- 10 -thermo [3] -Q50T-K91A 

As described previously (Example 5) , mutations derived 
from the basidiomycete phytase active sites were introduced into 
consensus phytase-10. The following five constructs a) to e) 
were prepared : 

30 a) The construct called consensus phytase-12, and it 

comprises a selected number of active site residues of the 
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basidio consensus sequence. Its amino acid sequence is shown in 
Fig. 21 (the first 26 amino acids form the signal peptide; 
positions differing from consensus phytase- 10 -thermo [3] -Q50T- 
K91A are underlined) ; 
5 b) a cluster of mutations (Cluster II) was transferred to 

the consensus phytase-1 and -10 sequences, viz.: S80Q, Y86F, 
S90G, K91A, S92A, K93T, A94R, Y95I; 

c) in a similar way, another cluster of mutations (Cluster 
III) was transferred, viz.: T129V, E133A, Q134N, M136S, V137S, 

10 N138Q, S139A; 

d) in a similar way, a further cluster of mutations 
(Cluster IV) was transferred, viz.: A168D, E171T, K172N, F173W; 

e) and finally, a further cluster of mutations (Cluster V) 
was transferred, viz.: Q297G, S298D, G300D, Y305T. 

is These constructs were expressed as described in Examples 6 

to 8. 

Example 12 

Phytase alignment using GAP 
20 The phytases described herein - i.e. the amino acid 

sequences as well as the corresponding DNA sequences - were 
aligned against each other. Also some other phytases were 
correspondingly aligned, viz. the following: 

- the consensus phytase-1 described in EP 897985; 

25 - the phytase derived from Aspergillus niger (ficuum) NRRL 313 5 
(A. niger NRRL3135) described in EP 420358; 

- the phytases derived from Aspergillus fumigatus ATCC 13073 (A. 
fumigatus 13073) ; Aspergillus fumigatus ATCC 32239 (A. 
fumigatus 32239); Aspergillus terreus cbsll6.46 (A.terreus cbs) ; 

30 Emericella nidulans (E. nidulans) ; and Talaromyces thermophilus 
(T. thermophilus) - all described in EP 897010; 
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the phytases derived from Myceliophthora thermophila (M. 
thermophila) ; and Aspergillus terreus 9-A1 (A. terreus 9-A1) - 
both described in EP 684313; 

the phytase derived from Thermomyces lanuginosus (T. 
5 lanuginosus) described in WO 9735017 (PCT/US97/04559) ; 

- the phytases derived from Agrocybe pediades (A. pediades) , 
Paxillus involutus 1 and 2 (P. involutus phyAl and phyA2) ; and 
Trametes pubescens (T. pubescens) - all described in WO 
98/28409; and 

10 - the phytase derived from Peniophora lycii (P. lycii) described 
in WO 98/28408. 

For the alignments, the program GAP was used with the 
settings as described above. 

For polypeptide comparisons, the signal peptides were 
15 included with the exception of comparisons with consensus 
phytase-11 . 

The results of the amino acid sequence comparisons are 
shown in Table 8 below. The first number in each cell is the 
amino acid similarity, the second number is the amino acid 
20 identity. 

For DNA sequence comparisons, the signal sequence was 
always included. The results are shown in Table 9 below. 

This invention comprises e.g. the following embodiments 
(A) to (J) that are described below. 

25 In these embodiments, when determining % identity or % 

similarity at the amino acid level for another phytase, its 
amino acid sequence is aligned with the reference sequence (e.g. 
in embodiment (A) the consensus phytase- 10 amino acid sequence) , 
using an alignment program such as GAP referred to above. 

30 Percentage identity, as well as percentage similarity, is 
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calculated by the program. The amino acid sequence of the other 
phytase may or may not include the signal peptide. 

When determining % identity on the DNA level for another 
phytase -encoding DNA, this DNA sequence is aligned with the 
5 reference sequence [e.g. in embodiment (A) nucleotides 12-1412 
of SEQ ID NO: 2 5 (the DNA sequence of consensus phytase -10 
(FcplO) as shown in Fig. 5] , using an alignment program such as 
GAP referred to above. Percentage identity is calculated by the 
program. The DNA sequence encoding the other phytase can be a 
10 genomic DNA sequence including introns, or it can be a cDNA 
sequence- It may or may not include the signal peptide-encoding 
part . 

When determining hybridization, the probe to be used is 
the specified DNA sequence [e.g. in embodiment (A) nucleotides 

15 12-1412 of SEQ ID NO: 25 (the DNA sequence of consensus phytase- 
10 (FcplO) as shown in Fig. 5) ] . The DNA sequence encoding the 
other phytase can be a genomic DNA sample which contains a 
phytase -encoding DNA-sequence; a purified genomic DNA sequence 
(purified with respect to the phytase -encoding DNA sequence) ; or 

20 it can be a phytase -encoding cDNA sequence, preferably purified 
or amplified, e.g. PCR-amplif ied. The phytase -encoding DNA, 
whatever type, may or may not include the signal peptide- 
encoding part. Suitable hybridization conditions are referred to 
above . 

25 The term * x DNA sequence" includes such fragments or parts 

of the herein exemplified DNA sequences, as long as they are 
capable of encoding an active enzyme (e.g. phytase). 

The term 1X amino acid sequence' ' includes such fragments 
or parts of the herein exemplified amino acid sequences, as long 

30 as they are enzymatically active (e.g. displaying phytase 
activity) . 
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(A) Phytases and corresponding DNA sequences related to 
consensus phytase-10 (CP10, Fcp 10) 

A phytase that comprises an amino acid sequence which is 
at least 93.80%; or at least 94, 94.5, 95, 95.5, 96, 96.5, 97, 
5 97.5, 98, 98.5, 99, 99.5% identical to the sequence of amino 
acids 1-467 of consensus phytase-10 (FcplO) as shown in Fig. 5. 

A phytase that comprises an amino acid sequence which is 
at least 95.09%; or at least 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% similar to the sequence of amino acids 1-467 of 
10 consensus phytase-10. 

A phytase which is encoded by a DNA sequence which is at 
least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to nucleotides 12-1412 of the DNA sequence of 
consensus phytase-10 (FcplO) as shown in Fig. 5. 
15 A DNA sequence which encodes a phytase and which (i) is at 

least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical; or (ii) hybridizes under low, or medium, 
medium/high, high, or very high stringency conditions to 
nucleotides 12-1412 of the DNA sequence of consensus phytase-10 
20 (FcplO) as shown in Fig . 5 . A suitable negative control is DNA 
encoding consensus phytase- 1. A suitable positive control is DNA 
encoding any of CP10, CP10 - thermo [3] -Q50T, K91A, CPl-thermo [8] , 
CPl-thermo [8] Q50T,K91A. 

A DNA sequence which encodes a phytase comprising an amino 
25 acid sequence which is at least 93.80%; or at least 94, 94.5, 
95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to 
the sequence of amino acids 1-467 of consensus phytase-10 
(FcplO) as shown in Fig. 5. 
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(B) Phytases and corresponding DNA sequences related to 
consensus phytase- 10 -thermo [3] -Q50T-K91A 

A phytase which comprises an amino acid sequence which is 
at least 93.37%; or at least 93.5, 94, 94.5, 95, 95.5, 96, 96.5, 
5 97, 97.5, 98, 98.5, 99, 99.5% identical to the sequence of amino 
acids 1-467 of consensus phytase- 10 -thermo [3] -Q50T-K91A as shown 
in Fig. 8 . 

A phytase which comprises an amino acid sequence which is 
at least 94.66%; or at least 95.0, 95.5, 96, 96.5, 97, 97.5, 98, 

10 98.5, 99, 99.5% similar to the sequence of amino acids 1-467 of 
consensus phytase- 10 -thermo [3] -Q50T-K91A as shown in Fig. 8. 

A phytase which is encoded by a DNA sequence which is at 
least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to nucleotides 12-1412 of the DNA sequence of 

15 consensus phytase-10- thermo [3 ] -Q50T-K91A as shown in Fig. 8. 

A DNA sequence which encodes a phytase and which (i) is at 
least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical; or (ii) hybridizes under low, or medium, 
medium/high, high, or very high stringency conditions to 

20 nucleotides 12-1412 of the DNA sequence of consensus phytase-10- 
thermo [3] -Q50T-K91A as shown in Fig. 8. A suitable negative 
control is DNA encoding consensus phytase- 1. A suitable positive 
control is DNA encoding any of CP10, CPlO-thermo [3] -Q50T-K91A, 
CPl-thermo [8] , or CPl-thermo [8] -Q50T-K91A. 

25 A DNA sequence which encodes a phytase comprising an amino 

acid sequence which is at least 93.37%; or at least 93.5, 94, 
94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to the sequence of amino acids 1-467 of consensus 
phytase- 10 -thermo [3] -Q50T-K91A as shown in Fig. 8. 

30 



5808. 200 -US/140100/KaPe 



62 

(C) Phytases and corresponding DNA sequences related to 
consensus phytase-l-thermo [8] 

A phytase which comprises an amino acid sequence which is 
at least 98.30%; or at least 98.5, 99, 99.5% identical to the 
5 sequence of amino acids 1-467 of consensus phytase- 1-thermo [8] 
(as shown in Fig. 7; backmutations T50Q and A91K to be added) . 

A phytase which comprises an amino acid sequence which is 
at least 98.51%; or at least 99, 99.5% similar to the sequence 
of amino acids 1-467 of consensus phytase-l-thermo [8] (as shown 

io in Fig. 7; backmutations T50Q and A91K to be added) . 

A phytase which is encoded by a DNA sequence which is at 
least 98.73%; or at least 99, 99.5% identical to nucleotides 1- 
1407 of the DNA sequence of consensus phytase- 1-thermo [8] (as 
shown in Fig. 7; backmutations T50Q and A91K to be added) . 

15 A DNA sequence which encodes a phytase and which (i) is at 

least 98.73%; or at least 99, 99.5% identical; or (ii) 
hybridizes under low, or medium, medium/high, high, or very high 
stringency conditions to nucleotides 1-14 07 of the DNA sequence 
of consensus phytase-l-thermo [8] (as shown in Fig. 7; 

20 backmutations T50Q and A91K to be added) . A suitable negative 
control is DNA encoding consensus phytase- 1. A suitable positive 
control is DNA encoding any of CPl-thermo [8] , CP1 -thermo [8] - 
Q50T-K91A. 

A DNA sequence which encodes a phytase comprising an amino 
25 acid sequence which is at least 98.30%; or at least 98.5, 99, 
99.5% identical to the sequence of amino acids 1-467 of 
consensus phytase-l-thermo [8] (as shown in Fig. 7; backmutations 
T50Q and A91K to be added) . 
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(D) Phytases and corresponding DNA sequences related to 
consensus phytase-l-thermo [8] -Q50T-K91A 

A phytase which comprises an amino acid sequence which is 
at least 97.87%; or at least 98, 98.5, 99, 99.5% identical to 
5 the sequence of amino acids 1-4 67 of consensus phytase-l- 
thermo [8] -Q50T-K91A as shown in Fig. 7. 

A phytase which comprises an amino acid sequence which is 
at least 98.08%; or at least 98.5, 99, 99.5% similar to the 
sequence of amino acids 1-467 of consensus phytase-l-thermo [8] - 
10 Q50T-K91A as shown in Fig. 7. 

A phytase which is encoded by a DNA sequence which is at 
least 98.37%; or at least 98.5, 99, 99.5% identical to 
nucleotides 1-1407 of the DNA sequence of consensus phytase-l- 
thermo [8] -Q50T-K91A as shown in Fig. 7. 
15 A DNA sequence which encodes a phytase and which (i) is at 

least 98.37%; or at least 98.5, 99, 99.5% identical; or (ii) 
hybridizes under low, or medium, medium/high, high, or very high 
stringency conditions to nucleotides 1-1407 of the DNA sequence 
of consensus phytase-l-thermo [8] -Q50T-K91A as shown in Fig. 7. A 
20 suitable negative control is DNA encoding consensus phytase- 1. A 
suitable positive control is DNA encoding any of CPl-thermo [8] , 
CPl-thermo [8] -Q50T-K91A. 

A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 97.87%; or at least 98, 98.5, 
25 99, 99.5% identical to the sequence of amino acids 1-467 of 
consensus phytase 1-thermo [8] -Q50T-K91A as shown in Fig. 7. 

(E) Phytases and corresponding DNA sequences related to 
consensus phytase - 1 1 

30 A phytase that comprises an amino acid sequence which is 

at least 90.71%; or at least 91, 91.5, 92, 92.5, 93, 93.5, 94, 
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94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to the sequence of amino acids 1-482 of consensus 
phytase-11 as shown in Fig. 6. 

A phytase that comprises an amino acid sequence which is 
5 at least 92.07%; or at least 92.5, 93, 93.5, 94, 94.5, 95, 95.5, 
96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% similar to the sequence 
of amino acids 1-482 of consensus phytase-11 as shown in Fig. 6. 

A DNA sequence that encodes a phytase comprising an amino 
acid sequence which is at least 90.71%; or at least 91. 91.5, 
10 92, 92.5, 93, 93.5, 94, 94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98.5, 99, 99.5% identical to the sequence of amino acids 1-482 
of consensus phytase-11 as shown in Fig. 6. 

(F) Phytases and corresponding DNA sequences related to A. 

15 f umigatus alpha -mutant 

A phytase that comprises an amino acid sequence which is 
at least 97.17%; or at least 97.5, 98, 98.5, 99, 99.5% identical 
to the sequence of amino acids 1-467 of A. fumigatus alpha- 
mutant (phytase) as shown in Fig. 9. 

20 A phytase that comprises an amino acid sequence that is at 

least 97.82%; or at least 98, 98.5, 99, 99.5% similar to the 
sequence of amino acids 1-467 of A. fumigatus alpha-mutant 
(phytase) as shown in Fig. 9. 

A phytase which is encoded by a DNA sequence which is at 

25 least 96.13%; or at least 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to nucleotides 1-1401 of the DNA sequence of A. 
fumigatus ATCC 13073 alpha-mutant (phytase) as shown in Fig. 9. 

A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 97.17%; or at least 97.5, 98, 

30 98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
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of A. fumigatus ATCC 13073 alpha-mutant (phytase) as shown in 
Fig. 9. 

A DNA sequence which encodes a phytase and which (i) is at 
least 96.13%; or 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical; 
5 or (ii) hybridizes under low, or medium, medium/high, high, or 
very high stringency conditions to nucleotides 1-1401 of the DNA 
sequence of A. fumigatus ATCC 13 073 alpha-mutant (phytase) shown 
in Fig. 9. A suitable negative control is DNA encoding A. 
fumigatus 13 073 phytase. A suitable positive control is DNA 
10 encoding any one of the A. fumigatus ATCC 13073 alpha mutant 
phytase or the optimised alpha-mutant. 

(G) Phytases and corresponding DNA sequences related to the 

optimized A. fumigatus alpha-mutant 
15 A phytase that comprises an amino acid sequence that is at 

least 96.08%; or at least 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 

identical to the sequence of the phytase of the optimized A. 

fumigatus alpha-mutant . 

A phytase that comprises an amino acid sequence that is at 
20 least 96.74%; or at least 97, 97.5, 98, 98.5, 99, 99.5% similar 

to the sequence of the phytase of the optimized A. fumigatus 

alpha-mutant . 

A phytase which is encoded by a DNA sequence which is at 
least 95.63%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
25 99.5% identical to nucleotides 1-1401 of the DNA sequence 
encoding the optimized A. fumigatus alpha-mutant phytase. 

A DNA sequence that encodes a phytase comprising an amino 
acid sequence that is at least 96.08%; or at least 96.5, 97, 
97.5, 98, 98.5, 99, 99.5% identical to the optimized A. 
30 fumigatus alpha-mutant phytase. 
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A DNA sequence which encodes a phytase and which (i) is at 
least 95.63%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical; or (ii) hybridizes under low, or medium, 
medium/high, high, very high stringency conditions to 
5 nucleotides 1-1401 of the DNA sequence encoding the optimized A. 
fumigatus alpha-mutant phytase. 

A suitable negative control is DNA encoding A. fumigatus 
ATCC 13073 phytase. A suitable positive control is DNA encoding 
any one of the A. fumigatus ATCC 13 0 73 alpha mutant phytase of 
10 the optimised alpha-mutant. 

(H) Phytases and corresponding DNA sequences related to 
consensus phytase- 7 

A phytase that comprises an amino acid sequence which is 
15 at least 94.87%; or at least 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98 .5, 99, 99 . 5% identical to the sequence of amino acids 1-467 
of consensus phytase -7 as shown in Fig. 10. 

A phytase that comprises an amino acid sequence which is 
at least 95.30%; or at least 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
20 99, 99.5% similar to the sequence of amino acids 1-467 of 
consensus phytase-7 as shown in Fig. 10. 

A phytase which is encoded by a DNA sequence which is at 
least 96. 38%; or 96.5, 97, 97. 5, 98, 98.5, 99, 99.5% identical 
to nucleotides 12-1412 of the DNA sequence of consensus phytase - 
25 7 shown in Fig. 10. 

A DNA sequence which encodes a phytase and which (i) is at 
least 96.38%; or at least 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical; or (ii) hybridizes under low, or medium, medium/high, 
high, or very high stringency conditions to nucleotides 12-1412 
30 of the DNA sequence of consensus phytase-7 as shown in Fig. 10. 
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A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 94.87%; or at least 95, 95.5, 
96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to the 
sequence of amino acids 1-4 67 of consensus phytase- 7 as shown in 
5 Fig. 10. 

(I) Phytases related to basidio consensus phytase 

A phytase which comprises an amino acid sequence which is 
at least 76.23%; or at least 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 94.5, 95, 95.5, 96, 96.5, 

10 97, 97.5, 98, 98.5, 99, 99.5% identical to the combined sequence 
of (i) amino acids 1-441 of basidio consensus phytase shown in 
Fig. 3, and (ii) amino acids 1-26 shown in Fig. 5 (the sequence 
of (ii) to be added at the N-terminus of the sequence of (i) ) . 

A phytase that comprises an amino acid sequence which is 

15 at least 79.50%; or at least 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% similar to the sequence of amino acids 1-441 of 
basidio consensus phytase as shown in Fig. 3. 
(J) Phytases related to consensus phytase-12 

20 A phytase which comprises an amino acid sequence which is 

at least 70, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 

88, 89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
of consensus phytase-12 as shown in Fig. 21. 

25 A phytase which comprises an amino acid sequence which is 

at least 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 

89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% similar to the sequence of amino acids 1-467 of 
consensus phytase-12 as shown in Fig. 21. 
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What is claimed is: 

1. A phytase that comprises an amino acid sequence which is at 
least 93.80% identical to the sequence of amino acids 1-467 of 

5 consensus phytase- 10 (SEQ ID NO: 26) . 

2. A phytase that is encoded by a DNA sequence that is at least 
95.88% identical to nucleotides 12-1412 of the DNA sequence of 
consensus phytase-10 (SEQ ID NO: 25) . 

10 

3 . A phytase that comprises 

an amino acid sequence chosen from 

(i) SEQ ID NO: 26, or amino acids 1-43 8 thereof; or 
an amino acid sequence encoded by 
15 (ii) nucleotides 12-1412, or 90-1412 of SEQ ID NO: 25. 

4 . A phytase that comprises 

an amino acid sequence chosen from 

(i) consensus phytase- 10 -thermo [3] , 
20 ( i:L > variants of (i) , further including the mutations 

Q50T, K91A, or (Q50T+K91A) , the latter variant being 
shown in Fig. 8, 
(iii) amino acids 27-467 of any of the sequences of (i) 
and (ii) , 

25 < iv > SE Q ID NO: 31, or amino acids 1-441 thereof; or 

an amino acid sequence encoded by 

(v) nucleotides 1-1401, or 79-1401 of SEQ ID NO: 30. 

5 . A phytase that comprises 

3 0 an amino acid sequence chosen from 

(i) consensus phytase- 1 -thermo [8] , 
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(ii) variants of (i) , further including the mutations 
Q50T, K91A, or (Q50T+K91A) , the latter variant being 
shown in Fig. 7, 

(iii) amino acids 27-467 of any of the sequences of (i) 
5 and (ii) , or 

(iv) SEQ ID NO: 29, or amino acids 1-441 thereof; or 
an amino acid sequence encoded by 

(v) nucleotides 1-1407, or 79-1407 of SEQ ID NO: 28. 

io 6. A phytase that comprises the amino acid sequence of 
consensus phytase-11 (SEQ ID NO: 27) . 

7. A DNA sequence that comprises a DNA-sequence encoding a 
phytase of claim 1 . 



15 



8. A DNA sequence that comprises a DNA-sequence encoding a 
phytase, wherein the phytase -encoding DNA-sequence is (i) at 
least 95.88% identical, or (ii) hybridises under high stringency 
conditions, to nucleotides 12-1412 of the DNA sequence of 

20 consensus phytase-10 (SEQ ID NO: 25) . 

9. A DNA sequence that comprises a DNA-sequence that encodes a 
phytase, wherein the phytase comprises an amino acid sequence 
that is at least 93.80% identical to the sequence of amino acids 

25 1-467 of consensus phytase 10 (SEQ ID NO: 26) . 

10. A DNA sequence that comprises a DNA-sequence that encodes a 
phytase, and wherein the phytase -encoding DNA-sequence comprises 

(i) nucleotides 12-1412, or 90-1412 of the DNA 

30 sequence of consensus phytase-10 (SEQ ID NO: 

25) ; 
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(ii) nucleotides 1-1401, or 79-1401 of the DNA 
sequence of consensus phytase- 10 -thermo [3] - 
Q50T-K91A (SEQ ID NO: 30); or 

(iii) nucleotides 1-1401, or 79-1401 of the DNA 
sequence of consensus phytase- 1 -thermo [8] - 
Q50T-K91A (SEQ ID NO: 28) . 



11. A vector comprising a DNA sequence according to claim 7. 

10 12. A microbial host cell comprising a DNA sequence according 
to claim 7 , or the vector according to claim 11. 

13. A process for producing a phytase, the process comprising 
culturing the host cell according to claim 12 under conditions 
is permitting the production of the phytase, and recovering the 
phytase from the culture broth. 



14. A food, feed or pharmaceutical composition comprising a 
phytase of claim 1 . 

20 
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ABSTRACT 

This invention relates to improved phytases, preferably 
phytases of an increased thermostability, and a process of 
producing them. In particular, stabilizing amino acid mutations 
5 are introduced into a homologous protein, or the active site of 
a phytase is replaced in part or in toto. The corresponding DNA 
sequences and methods of preparing them are also disclosed, as 
are methods of producing the improved phytases, and the use 
thereof. Specific variants of Aspergillus fumigatus phytase and 
10 of consensus phytases are disclosed. 
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A. terreus 9A-1 KhsDCNSVDh 

A. terreus cbs NhsDCTSVDr 
A. niger var. awamori NqsTCDTVDQ 

A. niger T213 NqsSCDTVDQ 

A. niger NRRL3135 NqsSCDTVDQ 

A. fumigratus 13 073 GSkSCDTVDl 

A. fumigatus 32722 GSkSCDTVDl 

A. fuznigratus 58128 GSkSCDTVDl 

A. fumigatus 26906 GSkSCDTVDl 

A. fumigatus 32239 GSkACDTVEl 

S. nidulans QNHSCNTADG 

T. thermophilus DSHSCNTVEG 

M. thermophila ESRPCDTpDl 



GYQCFPELSH 
GYQCFPELSH 
GYQCFSETSH 
GYQCFSETSH 
GYQCFSETSH 
GYQCsPATSH 
GYQCsPATSH 
GYQCsPATSH 
GYQCsPATSH 
GYQCsPGTSH 
GYQCFPNVSH 
GYQCrPEISH 
GFQCgTAISH 



kWGlYAPYFS 
kWGlYAPYFS 
LWGQYAPFFS 
LWGQYAPFFS 
LWGQYAPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
VWGQYSPYFS 
sWGQYSPFFS 
FWGQYSPYFS 



LQDESPFP1D 
LQDESPFP1D 
LANESAISPD 
LANESVISPD 
LANESVISPE 
LEDE1SVSSK 
LEDE1SVSSK 
LEDE1SVSSK 
LEDE1SVSSK 
LEDE1SVSSD 
IEQESAISeD 
LADQSEISPD 
VpSElDaS . . 
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VPEDChlTFV 
VPDDChlTFV 
VPAGCrVTFA 
VPAGCrVTFA 
VPAGCrVTFA 
LPKDCrlTLV 
LPKDCrlTLV 
LPKDCrlTLV 
LPKDCrlTLV 
LPKDCrVTFV 
VPHGCeVTFV 
VPQNCklTFV 
IPDDCeVTFA 



Consensus 
Consensus phytase 



NSHS CDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDC-VTFV 
NSHSCDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDCRVTFV 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13 073 

A. fumigatus 32722 

A. fumigatus 5812 8 

A. fumigatus 26906 

A. fumigatus 3223 9 

E. nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 
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QVLARHGARs 
QVLARHGARs 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QLLSRHGARY 
QVLSRHGARa 



PThSKtKAYA 
PTDSKtKAYA 
PTESKgKkYS 
PTESKgKkYS 
PTDSKgKkYS 
PTSSKsKkYK 
PTSSKsKkYK 
PTSSKsKkYK 
PTSSKsKkYK 
PTASKsKkYK 
PTES Ks KAYS 
PTSSKtElYS 
PTIKRaaSYv 



AtlAAIQKSA 
AtlAAIQKNA 
ALIEEIQQNV 
ALIEEIQQNV 
ALIEEIQQNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAIQKNA 
GLIEAIQKNA 
QLISrIQKTA 
DLIDrlHhGA 



TaFpGKYAFL 
TaLpGKYAFL 
TtFDGKYAFL 
TtFDGKYAFL 
TtFDGKYAFL 
TdFKGKFAFL 
TdFKGKFAFL 
TdFKGKFAFL 
TdFKGKFAFL 
TeFKGKFAFL 
TsFwGQYAFL 
TaYKGyYAFL 
I s YgPgYEFL 



100 

QSYNYSLDSE 
KSYNYSMGSE 
KTYNYSLGAD 
KTYNYSLGAD 
KTYNYSLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
ETYNYTLGAD 
ESYNYTLGAD 
KDYrYqLGAN 
RTYDYTLGAD 



QVLSRHGARY PTSSK-KAYS ALIEAIQKNA T-FKGKYAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKAYS ALIEAIQKNA TAFKGKYAFL KTYNYTLGAD 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13073 

A. fuinigratus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 3223 9 

E. nidulans 

T. thermophilus 

M. thermophila 



101 

ELTPFGrNQL 
NLTPFGrNQL 
DLTPFGEQEL 
DLTPFGEQEL 
DLTPFGEQEL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTAFGEQQL 
DLTPFGEQQM 
DLTiFGENQM 
DLTPFGENQM 
ELTRtGQQQM 



rDlGaQFYeR 
qDlGaQFYRR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQK 
VDSGaKFYRR 
IQlGIKFYnH 
VNSGIKFYRR 



YNALTRh I nP 
YDTLTRhlnP 
YESLTRNIIP 
YESLTRNIIP 
YESLTRNIVP 
YKALARSWP 
YKALARSWP 
YKALARSWP 
YKALARSWP 
YKALAgSWP 
YKNLARKnTP 
YKSLARNaVP 
YRALARKs I P 



FVRATD AS R V 
FVRAADSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRSSGSDRV 
FIRASGSDRV 
FVRCSGSDRV 
FVRTAGqDRV 



150 

hESAEKFVEG 
hESAEKFVEG 
IASGEKFIEG 
IASGEKFIEG 
IASGKKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
VASAEKFING 
IASGrlFIEG 
VhSAENFTQG 



Consensus 
Consensus phytase 



DLTPFGENQM VNSGIKFYRR YKALARK-VP FVRASGSDRV IASAEKFIEG 
DLTPFGENQM VNSGIKFYRR YKALARKIVP FIRASGSDRV IASAEKFIEG 
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151 200 

A. terreus 9A-1 FQTARqDDHh ANpHQPSPrV DValPEGSAY NNTLEHS1CT AFES . . . STV 

A. terreus cbs FQNARqGDPh ANpHQPSPrV DWIPEGTAY NNTLEHSICT AFEA. . . STV 
A. niger var. awamori FQSTKLkDPr AqpgQSSPkl DWISEASSs NNTLDPGTCT VFED...SEL 

A. niger T213 FQSTKLkDPr AqpgQSSPkl DWISEASSs NNTLDPGTCT VFED. . .SEL 

A. niger NRRL3135 FQSTKLkDPr AqpgQSSPkl DWISEASSs NNTLDPGTCT VFED...SEL 

A. fumigatus X3073 FQqAKLADPG A . TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA . . . SQL 

A. fumigatus 32722 FQqAKLADPG A. TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA... SQL 

A. fumigatus 58128 FQqAKLADPG A . TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA. ..SQL 

A. fumigatus 26906 FQqAKLADPG A. TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA... SQL 

A. fumigatus 32239 FQqANVADPG A . TNRAAPVI SVIIPESETY NNTLDHS VCT NFEA...SEL 

E. nidulans FRKAQLhDHG S. .gQATPW NVIIPEiDGF NNTLDHSTCV SFEN. . . DEr 

T. thermophilus FQSAKVlDPh SDkHDAPPTI NVIIeEGPSY NNTLDtGSCP VFED...SSg 

M. thermophila FHSA1LADRG STvRPTlPyd itlWI PETAGa NNTLHND1CT AFEEgpySTI 

Consensus FQSAKLADPG S-PHQASPVI NVIIPEGSGY NNTLDHGTCT AFED SEL 

Consensus phytase FQSAKLADPG SQPHQASPVI DVIIPEGSGY NNTLDHGTCT AFED... SEL 

201 250 

A. terreus 9A-1 GDDAvANFTA VFAPAIaQRL EADLPGVqLS TDDWnLMAM CPFETVS1TD 

A. terreus cbs GDAAADNFTA VFAPAIakRL EADLPGVqLS ADDWnLMAM CPFETVS1TD 

A. niger var. awamori ADTVEANFTA TFAPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. niger T213 ADTVEANFTA TFAPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. niger NRRL3135 ADTVEANFTA TFVPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. fumigatus 13 073 GDEVAANFTA lFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 32722 GDEVAANFTA lFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 5812 8 GDEVAANFTA lFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 26906 GDEVAANFTA lFAPDIRARa KkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 32239 GDEVEANFTA 1FAPAIRARI EkHLPGVqLT DDDWsLMDM CS FDTVARTA 

E. nidulans ADE i EANFTA IMGPPIRkRL ENDLPGIKLT NENVI yLMDM CS FDTMARTA 

T. thermophilus GHDAQE KFAk qFAPAIlEKI KDHLPGVDLA vSDVpyLMDL CPFETLARNh 

M. thermophila GDDAQDTY1S TFAGPItARV NANLPGANLT DADTVa LMDL CPFETVAsSS 

Consensus GDDAEANFTA TFAPAIRARL EADLPGVTLT DEDW-LMDM CPFETVARTS 

Consensus phytase GDDVEANFTA LFAPAIRARL EADLPGVTLT DEDWYLMDM CPFETVARTS 



A. terreus 9A-1 DAhTLSPFC DLFTAtEWtq YNYL1SLDKY YGYGGGNPLG 

A. terreus cbs DAhTLSPFC DLFTAaEWtq YNYL1SLDKY YGYGGGNPLG 

A. niger var. awamori vDTKLSPFC DLFTHdEWih YDYLQSLkKY YGHGAGNPLG 

A. niger T213 vDTKLSPFC DLFTHdEWih YDYLRSLkKY YGHGAGNPLG 

A. niger NRRL3135 vDTKLSPFC DLFTHdEWin YDYLQSLkKY YGHGAGNPLG 

A. fumigatus 13 073 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32722 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 58128 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 26906 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32239 DASELSPFC AIFTHnEWkk YDYLQSLGKY YGYGAGNPLG 

E. nidulans HGTELSPFC AIFTEkEWlq YDYLQSLSKY YGYGAGSPLG 

T. thermophilus TDT.LSPFC ALsTQeEWqa YDYYQSLGKY YGnGGGNPLG 

M. thermophila sdpatadagg gNGrpLSPFC rLFSEsEWra YDYLQSVGKW YGYGPGNPLG 

Consensus -DATELSPFC ALFTE-EW- - YDYLQSLGKY YGYGAGNPLG 

Consensus phytase DATELSPFC ALFTHDEWRQ YDYLQSLGKY YGYGAGNPLG 
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301 

A. terreus 9A-1 PVQGVGWaNE 

A. terreus cbs PVQGVGWaNE 
A. niger var. awamori PTQGVGYaNE 

A. nlger T213 PTQGVGYaNE 

A. niger NRRL313 5 PTQGVGYaNE 

A. fumigatus 13 073 PAQGIGFtNE 

A. fumigatus 32722 PAQGIGFtNE 

A. fumigatus 58128 PAQGIGFtNE 

A. fumigatus 26906 PAQGIGFtNE 

A. fumigatus 32239 PAQGIGFtNE 

S. nidulans PAQGIGFtNE 

T. thermophilus PAQGVGFvNE 

thermophi I a PTQGVGFvNE 



LMARLTRAPV 
LIARLTRSPV 
LIARLTHSPV 
LIARLTHSPV 
LIARLTHSPV 
LIARLTRSPV 
LIARLTRSPV 
LIARLTRSPV 
LIARLTRSPV 
LIARLTNSPV 
LIARLTQSPV 
LIARMTHSPV 
LLARLAgvPV 



HDHTCVNNTL 
HDHTCVNNTL 
HDDTSSNHTL 
HDDTSSNHTL 
HDDTSSNHTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDNTSTNHTL 
QDYTTVNHTL 
RDgTSTNRTL 



DASPATFPLN 
DAMP AT F P LN 
DSNPATFPLN 
DSNPATFPLN 
DSSPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
DSDPATFPLN 
DSNPATFPLD 
DSNPATFPLN 
DGDPrTFPLG 



350 

ATLYADFSHD 
ATLYADFSHD 
STLYADFSHD 
STLYADFSHD 
STLYADFSHD 
ATMYVDFSHD 
ATMYVDFSHD 
ATMYVDFSHD 
ATMYVDFSHD 
ATIYVDFSHD 
rKLYADFSHD 
ATLYADFSHD 
rPLYADFSHD 



Consensus 
Consensus phytase 



PAQGVGF-NE LIARLTHSPV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 
P AQGVG FANE LIARLTRSPV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13 073 

A. fumigatus 32722 

A. fumigatus 5 8128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. thermophilus 

M . thermophi 1 a 



351 

SNLVSIFWAL 
SNLVSIFWAL 
NGIISILFAL 
NGIISILFAL 
NGIISILFAL 
NSMVSIFFAL 
NSMVS I FFAL 
NSMVSIFFAL 
NSMVSIFFAL 
NGMIPIFFAM 
NSMISIFFAM 
NTMTS I FaAL 
NDMMGVLgAL 



GLYNGTAPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTEPLS 
GLYNGTGPLS 
GLYNGTEPLS 
GLYNGTEPLS 
GLYNGTEPLS 
GLYNGTQPLS 
GLYNGTAkLS 
GaYDGVPPLD 



qTSVESVSQT 
qTTVEDITrT 
TTTVENI TQT 
TTTVENITQT 
TTTVENI TQT 
rTSVESaKEl 
rTSVESaKEl 
rTSVESaKEl 
rTSVESaKEl 
qTSeESTKES 
mDSVESIQEm 
TTEIKSIEET 
KTArrDpEEl 



DGYAAAWTVP 
DGYAAAWTVP 
DGFSSAWTVP 
DGFSSAWTVP 
DGFSSAWTVP 
DGYSASWWP 
DGYSASWWP 
DGYSASWWP 
DGYSASWWP 
NGYSASWAVP 
DGYAASWTVP 
DGYSAAWTVP 
GGYAAS WAVP 



400 

FAARAYVEMM 
FAARAYIEMM 
FASR1YVEMM 
FASR1YVEMM 
FASR1YVEMM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFELM 
FGGRAYI EMM 
FAARiYVEKM 



Consensus 
Consensus phytase 



NSMISIFFAL GLYNGTAPLS TTSVESIEET DGYAASWTVP FGARAYVEMM 
NSMISIFFAL GLYNGTAPLS TTSVESIEET DGYSASWTVP FGARAYVEMM 



401 450 

A. terreus 9A-1 QC RAEKE PLVRVLVNDR VMPLHGCPTD KLGRCKrDAF 

A. terreus cbs QC RAEKQ PLVRVLVNDR VMPLHGCAVD NLGRCKrDDF 

A. ni^rer var. awamori QC QAEQE PLVRVLVNDR WPLHGCPID aLGRCTrDSF 

A. niger T213 QC QAEQE PLVRVLVNDR WPLHGCPID aLGRCTrDSF 

A. nigrer NRRL313 5 QC QAEQE PLVRVLVNDR WPLHGCPVD aLGRCTrDSF 

A. fumigatus 13073 QC KSEKE PLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 32722 QC KSEKE PLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 5812 8 QC KSEKE SLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 26906 QC KSEKE PLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 32239 QC KSEKE PLVRALINDR WPLHGCAVD KLGRCKLKDF 

E. nidulans QC E.KKE PLVRVLVNDR WPLHGCAVD KFGRCTLDDW 

T. thermophilus QC DDSDE PWRVLVNDR WPLHGCEVD SLGRCKrDDF 

M. thermophi la RCsggggggg ggegrQEKDE eMVRVLVNDR VMTLkGCGAD ErGMCTLErF 

Consensus QC QAEKE PLVRVLVNDR WPLHGCAVD KLGRCKLDDF 

Consensus phytase QC QAEKE PLVRVLVNDR WPLHGCAVD KLGRCKRDDF 
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A. terreus 9A-1 
A. terreus cbs 
A. niger var. awamorl 
A, niger T213 
A. niger NRRL313 5 
A. fumigatus 13 073 
A. fumigatus 32722 
A. fumigatus 58128 
A. fumigatus 26906 
A. fumigatus 32239 
E. nidulans 
T. thermophilus 
M . thermophi 1 a 

Consensus 
Consensus phytase 



451 471 

VAGLSFAQAG GNWADCF ~ 

VEGLSFARAG NWAECF 

VrGLSFARSG GDWAECsA — - 
VrGLSFARSG GDWAECFA — - 
VrGLSFARSG DWAECFA — 
VKGLSWARSG GNWGECFS — ~ 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG NSEQSFS— 
VEGLNFARSG GNWkTCFTl- ~ 
VrGLSFARqG GNWEGCYAas e 
I E SMAFARGN GKWDlCFA — ~ 

VEGLSFARSG GNWAECFA-- - 
VEGLSFARSG GNWAECFA . . . 
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CP-1 

ECO RIMGVFVVLLS IATLFGST 

TATATGAarrCATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

1 + + + + + + 60 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 



SGTALGPRGNSHSCDTVDGG 
CATCCGGTACCGGCTTGGGTCGTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCAXTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 

YQCFPE ISHLWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTGTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 



EDESAISPDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4 

CP-5 

VLSRHGARYPTSSKSKAYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGAC 



LIEAIQKNATAFKGKYAFLK 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8 

CP-9 

IRASGSDRVIASAEKFIEGF 

TCATTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 
481 + + + + + + 54Q 

AGTAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQAS PVID 
TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 
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CP-10 

CP-11 

VIIPEGSGYNNTLDHGTCTA 
ACGTTATTATTCCAGAAGGaTCcGGTTACAACAACACTTTGGACCACGGTACTTGTACTG 
60 l + + + + + + 660 

TGCAATAATAAGGTCTTCCtAGgCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGAC 

FEDSELGDDVEANFTALFAP 
CTTTCGAAGACTCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTGAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP -12 

AIRARLEADLPGVTLTDEDV 
CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACG 
721 + + + + + + 780 

GTCGATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGC 

CP-13 

VYLMDMCPFETVARTSDATE 
TTGTTTACTTGATGGACATGTGTCCATTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + + 840 

AACAAATGAACTACCTGTACACAGGTAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRQYDYLQ 
AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGACAATACGACTACTTGC 
841 + + + + + + 900 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTGTTATGCTGATGAACG 

CP-14 

CP-15 

SLGKYYGYGAGNPLGPAQGV 

AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + + + + + + 96Q 

TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

GFANELIARLTRSPVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 

961 + + + + + + 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17 

TNHTLDSNPATFPLNATLYA 

CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 

1021 + + + + + + 1Q80 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNSMIS IFFALGLYNG 

CTGACTTCTCTCACGACAACTCTATGATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 114Q 

GACTGAAGAGAGTGCTGTTGAGATACTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18 

CP-19 

TAPLSTTSVESIEETDGYSA 

GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTG 

1141 + + + + + + 120Q 

CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAC 
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SWTVPFGARAYVEMMQCQAE 
CTTCTTGGACTGTTCCATTCGGTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 



1201 



+ 



+ 



+ 



+ 1260 



GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLSFAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* Eco RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 
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P. 
P. 
T. 
A. 
P. 



involutus 

involutus 

pubescens 

ped.ia.des 

lycil 



Basidio 



(phyAl) SvP.KnTAPt FPIPeseQrn WSPYSPYFPL AeYkAPPAGC 
(phyA2) SvP.RniAPK FSIPeseQrn WSPYSPYFPL AeYkAPPAGC 
hiPlRdTSAc LdVTrDvQqs WSmYSPYFPa AtYvAPPASC 
GgwQaTfvQ pfFPpQiQds WAAYTPYYPV qaYtPPPkDC 
StQf sfvAAQ LPIPaQntsn WGPYdPFFPV EpYaAPPEGC 



50 

QlnQVNIIQR 
ElnQVNIIQR 
QlnQVHIIQR 
KItQVNIIQR 
tVtQVNLIQR 



S-P-R-TAAQ LPIP-Q-Q- - WSPYSPYFPV A-Y-APPAGC QI-QVNIIQR 



P. involutus 

P. involutus 

T. pubescens 

A. pediades 

P. lycii 

Basidio 



51 

(phyAl) HGARFPTSGA TTRIKAGLTK LQGvqnf TDA KFNFIkSfkY 
{phyA2) HGARFPTSGA ATRIKAGLSK LQSvqnf TDP KFDFIkSf TY 
HGARFPTSGA AkR I QTAVAK LKAAsnyTDP lLAFVtNyTY 
HGARFPTSGA GTRIQAAVkK LQSAktyTDP RLDFLtNyTY 
HGARWPTSGA rSRqvAAVAK IQmArpf TDP KYEFLnDfvY 



100 

dLGnsDLVPF 
dLGtsDLVPF 
sLGqDsLVeL 
tLGhDDLVPF 
kFGvADLLPF 



HGARFPTSGA ATR I QAAVAK LQSA TDP KLDFL-N-TY -LG-DDLVPF 



P. involutus 

P. involutus 

T. pubescens 

A . pediades 

P. lycii 

Basidio 



101 

(phyAl) GAaQSfDAGQ 
(phyA2) GAaQSf DAG1 
GAtQSSEAGQ 
GA1QSSQAGE 
GAnQShQTGt 



EAFARYSkLV 
EvFARYSkLV 
EAFTRYSsLV 
ETFqRYS f LV 
DmYTRYStLf 



SkNNLPFIRA 
SsDNLPFIRS 
SaDELPFVRA 
SkENLPFVRA 
egGDVPFVRA 



dGSDRWDSA 
dGSDRWDTA 
SGSDRWATA 
SSSNRWDSA 
AGdQRWDSS 



150 

TNWTAGFAsA 
TNWTAGFAsA 
nNWTAGFAlA 
TNWTEGFSaA 
TNWTAGFGdA 



GA-QSSQAGQ EAFTRYS-LV S-DNLPFVRA SGSDRWDSA TNWTAGFA - A 



P. involutus 

P. involutus 

T. pubescens 

A. pediades 

P . lyci i 

Basidio 



151 

(phyAl) ShNTvqPkLn LILPQtGNDT LEDNMCPaAG DSDPQvNaWL 
(phyA2) SrNAiqPkLd LILPQtGNDT LEDNMCPaAG ESDPQvDaWL 
SsNSitPvLs VIISEaGNDT LDDNMCPaAG DSDPQvNqWL 
ShHvlnPiLf VILSEslNDT LDDaMCPnAG sSDPQtGiWt 
SgETvlPtLq WLqEeGNcT LcNNMCPnEv DGDest.tWL 



200 

AVa f PS I TAR 
Asaf PSVTAQ 
AqFAPPMTAR 
SIYGTPIAnR 
GVFAPnITAR 



S-NT--P-L- VILSE-GNDT LDDNMCP-AG DSDPQ-N-WL AVFAPPITAR 



201 

P. involutus (phyAl) LNAAAPSvNL TDtDAfNLvs LCAFlTVSkE 

P. involutus (phyA2) LNAAAPGANL TDaDAfNLvs LCPFmTVSkE 

T. pubescens LNAGAPGANL TDtDTyNLlt LCPFETVAtE 

A. pediades LNqqAPGANI TAaDvsNLip LCAFET I vkE 

P. lycii LNAAAPSANL SDsDAltLmd MCPFDTLSsG 



kkSdFCtLFE 
qkSdFCtLFE 
rrSeFCDIYE 
tpSpFCNLF. 
naSpFCDLF . 



250 

giPGsFeAFa 
giPGsFeAFa 
elQAE . dAFa 
. tPEEFaqFe 
. tAEEYvSYe 



Basidio 



LNAAAPGANL TD-DA-NL- - LCPFETVS-E --S-FCDLFE - -PEEF-AF- 



Fig. 3a 
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P. involutus 

P. involutus 

T. pubescens 

A. pediades 

P . lyci i 

Basidio 



251 300 

iphyAl) YgGDLDKFYG TGYGQeLGPV QGVGYVNELI ARLTnsAVRD NTQTNRTLDA 

(phyA2) YaGDLDKFYG TGYGQALGPV QGVGYINELL ARLTnsAVnD NTQTNRTLDA 

YnADLDKFYG TGYGQPLGPV QGVGYINELI ARLTaQnVsD HTQTNsTLDS 

YfGDLDKFYG TGYGQPLGPV QGVGYINELL ARLTemPVRD NTQTNRTLDS 

YyyDLDKYYG TGpGNALGPV QGVGYVNELL ARLTgQAVRD ETQTNRTLDS 

Y - GDLDKF YG TGYGQPLGPV QGVGYINELL ARLT - QAVRD NTQTNRTLDS 



P. involutus 

P. involutus 

T. pubescens 

A, pediades 

P. lycii 



iphyAl) 
(phyA2) 



301 

SPvTFPLNKT 
APdTFPLNKT 
SPeTFPLNRT 
SP1TFPLDRS 
dPaTFPLNRT 



FYADFSHDNl 
MYADFSHDN1 
LYADFSHDNQ 
IYADLSHDNQ 
FYADFSHDNt 



MVAVFSAMGL 
MVAVFSAMGL 
MVAIFSAMGL 
MIAIFSAMGL 
MVP I FAALGL 



FrQPAPLsTS 
FrQSAPLsTS 
FNQSAPLDPT 
FNQSSPLDPS 
FNaTA.LDPl 



350 

vPNPwRTWrT 
tPDPNRTWLT 
tPDPaRTFLv 
fPNPKRTWVT 
kPDeNRlWVd 



Basidio 



SP-TFPLNRT FYADF SHDNQ MVAIFSAMGL FNQSAPLDPS -PDPNRTWVT 



P. 
P. 
T. 
A . 
P. 



involutus 

involutus 

pubescens 

pediades 

lycii 



Basidio 



351 400 

(phyAl) SsLVPFSGRM WERLsC.f GT tkV RVLVQDqVQP 

(phyA2) SsWPFSARM aVERLsC . . a GT tkV RVLVQDqVQP 

kKIVPFSARM WERLdC . . g GA qsV RLLVNDAVQP 

SRLtPFSARM VtERLlCqrd GTgsggpsri mrngnvqtfV RILVNDALQP 
SKLVPFSGHM tVEKLaC sgkeaV RVLVNDAVQP 

SKLVPFSARM WERL-C GT V RVLVNDAVQP 



P. involutus iphyAl) 

P. involutus (phyA2) 

T. pubescens 

A. pediades 

P. lycii 

Basidio 



401 

LEFCGGDrNG ICTLAkFVES 
LEFCGGDqDG ICALDkFVES 
LAFCGADtsG vCTLDAFVES 
LKFCGGDmDS 1CTLEAFVES 
LEFCGG.vDG vCeLsAFVES 

LEFCGGD-DG -CTLDAFVES 



441 

QtFARsDGaG DFEKCFATSa ~ 
QaYARsGGaG DFEKCLATTv - 
QaYARNDGEG DFEKCFAT — - 

QkYAREDGQG DFEKCFD ~ 

QtYARENGQG DFAKCgfvPs e 

Q-YAREDGQG DFEKCFATP - - 



Fig. 3b 
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A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T . thermophilus 

T. lanuginosus 

M. thermophila 

Basidio 



1 50 
KhsdCNSVDh GYQCf PELSH kWGlYAPYFS LqDESPFPlD VPeDCHITFV 
NhsdCtSVDr GYQCf PELSH kWGlYAPYFS LqDESPFPlD VPdDCHITFV 
NqsTCDTVDq GYQCf SEtSH LWGQYAPFFS LANESAISPD VPaGCRVTFa 
NqsSCDTVDq GYQCf SEtSH LWGQYAPFFS LANESvISPE VPaGCRVTFa 
GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK LPkDCRI TLV 
GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK LPkDCRI TLV 
GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK LPkDCRI TLV 
GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK LPkDCRI TLV 
GSkACDTVEl GYQCsPGtSH LWGQYSPFFS LEDE1SVSSD LPkDCRVTFV 
QNHSCNTaDG GYQCf PNVSH VWGQYSPYFS IEQESAISeD VPhGCeVTFV 
DSHSCNTVEG GYQCrPEISH sWGQYSPFFS LADQSEISPD VPqNCKITFV 

nvDIAR hWGQYSPFFS LAEvSEISPA VPkGCRVeFV 

ESRPCDTpDl GFQCgTAI SH FWGQYSPYFS VPsElDaS.. IPdDCeVTFa 
xSxPxrxtAA qLPipxQxqx xWSPYSPYFP VAxyxA. . . . pPaGCQIxqV 



Consensus NSHSCDTVDG GYQC-PEISH LWGQYSPFFS LADES AISPD VP-GCRVTFV 
FcplO NSHSCDTVDG GYQCFPEISH LWGQYSPFFS LADES AISPD VPKGCRVTFV 



51 100 

A. terreus 9al QVLARHGARs PThSKTKaYA AtlaAIQKSA TaFpGKYAFL QSYNYSLDSE 

A. terreus cbs QVLARHGARs PTdSKTKaYA AtlaAIQKNA TaLpGKYAFL KSYNYSMGSE 

A. niger var. awamori QVLSRHGARY PTeSKGKKYS ALIeEIQQNv T t FDGKYAFL KTYNYS LGAD 

A. niger NRRL3135 QVLSRHGARY PTdSKGKKYS ALIeEIQQNA T t FDGKYAFL KTYNYS LGAD 

A. fumigatus 13 073 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32722 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 58128 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 26906 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32239 QVLSRHGARY PTASKSKKYk kLVtAIQKNA TeFKGKFAFL ETYNYTLGAD 

E. nidulans QVLSRHGARY PTeSKSKaYS GLIeAIQKNA TsFwGQYAFL ESYNYTLGAD 

T. thermophilus QLLSRHGARY PTSSKTE1YS qLIsrIQKtA TaYKGyYAFL KdYrYqLGAN 

T. lanuginosus QVLSRHGARY PTAhKSEvYA ELLqrIQDtA TeFKGDFAFL RdYaYhLGAD 

M. thermophila QVLSRHGARa PTlkRAasYv DLIdrlHhGA isYgPgYEFL RTYDYTLGAD 

Basidio NIIqRHGARF PTSGaAtRiq AaVakLQsax xxtDPKLDFL xnxtYxLGxD 

Consensus QVLSRHGARY PTSSKSKKYS ALI-AIQKNA T-FKGKYAFL KTYNYTLGAD 

FcplO QVLSRHGARY PTSSKSKKYS ALIEAIQKNA TAFKGKYAFL KTYNYTLGAD 



101 150 

A. terreus 9al ELTPFGrNQL rDlGaQFYeR YNAL.TRhln P FVRATD As R VhESAEKFVE 

A. terreus cbs NLTPFGrNQL qDlGaQFYRR YDTL.TRhln PFVRAADSsR VhESAEKFVE 

A. niger var. awamori DLTPFGEQEL VNSGIKFYQR YESL . TRnll PFIRSSGSsR VIASGEKFIE 

A. niger NRRL3135 DLTPFGEQEL VNSGIKFYQR YESL . TRnlV PFIRSSGSsR VIASGKKFIE 

A. fumigatus 13073 DLTPFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

A. fumigatus 32722 DLTPFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

A. fumigatus 58128 DLTPFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

A. fumigatus 26906 DLTAFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

A. fumigatus 32239 DLTPFGEQQM VNSGIKFYQK YKAL.AgsW PFIRSSGSDR VIASGEKFIE 

E. nidulans DLTiFGENQM VDSGaKFYRR YKnL . ARknt PFIRASGSDR WASAEKFIN 

T. thermophilus DLTPFGENQM IQlGIKFYnH YKSL.ARnaV PFVRCSGSDR VIASGrlFIE 
T. lanuginosus NLTRFGEEQM MESGrQFYHR YREq.ARelV PFVRAAGSAR VIASAEf Fnr 

M, thermophila ELTRtGQQQM VNSGI KFYRR YRAL.ARksI P FVRTAGqD R WhSAENFtQ 

Basidio DLvPFGAxQs sQAGqEaFtR YsxLvSxdnL PFVRASGSDR WDSAtNWtA 

Consensus DLTPFGEQQM VNSGI KFYRR YKAL-AR-IV PFVRASGSDR VIASAEKFIE 

FcplO DLTPFGEQQM VNSGIKFYRR YKAL . ARKIV PFVRASGSDR VIASAEKFIE 



Fig. 4a 



A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

i?. .nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 

Basidio 

Consensus 
FcplO 
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151 200 
GFQTARqDDh hAnphQPSPr VDValPEGsA YNNTLEHSLC TAFEs . . .St 
GFQNARqGDP hAnphQPSPr VDWIPEGtA YNNTLEHSIC TAFEa . . . St 
GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC TvFEd. . 
GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC TvFEd. . 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa. . 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa.. 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa.. 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa.. 
GFQqANVADP gAt . nRAAPV ISVIIPESeT YNNTLDHSVC TnFEa . . 
GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNWTLDHStC vSFEn...dE 
GFQSAKV1DP hSdkhDAPPt INVIIeEGpS YNNTLDtGsC PvFEd...Ss 

GFQdAKdrDP rSnkdQAePV INVIISEEtG sNNTLDgltC PAaEe 
GFHSA1LADR gStvrPTlPy dmWIPETaG aNNTLHNDLC TAFEegPySt 
GFaxA sxntxxPx LxVILSExg. . NDTLDDNMC PxAG 



.SE 
.SE 
.SQ 
.SQ 
.SQ 
.SQ 
.SE 



. -Ap 



GFQSAKLADP -A QASPV INVIIPEG-G YNNTLDHGLC TAFE--P-SE 

GFQSAKLADP GANPHQASPV INVIIPEGAG YNNTLDHGLC TAFEE SE 



A. terreus 9al 
A. terreus cbs 
A. niger var. awamori 
A. niger NRRL313 5 
A. fumigatus 13 073 
A. fumigatus 32722 
A. fumigatus 58128 
A. fumigatus 26906 

A. fumigatus 32239 

B. nidulans 

T. thermophilus 
T. lanuginosus 
M. thermophila 
Basidio 



201 250 
VGDD a vANFT AVFAPAIaqR LEAdLPGVQL StDDWNLMA MCPFETVS1T 
VGDAaADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA MCPFETVS1T 
LADtVEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LADtVEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVAr T 
LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVArT 
LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVArT 
LGDEVAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD MCSFDTVArT 
LGDEVEANFT ALFAPAIRAR IEkhLPGVQL TDDDWSLMD MCSFDTVArT 
rADEIEANFT AIMGPPIRkR LEndLPGIKL TNENVIyLMD MCSFDTMArT 
gGHDaQEKFA kqFAPAIlEK I KDhLPGVDL AvsDVpyLMD LCPFETLArn 

. DptqpAEFl qVFGPRVlkK ItkhMPGVNL TlEDVplFMD LCPFDTVGsd 
IGDDaQDtYl StFAGPItAR VNAnLPGaNL TDADtVaLMD LCPFETVAsS 
dSDpqxnxWl AVFAPPI t AR LNAaaPGaNL TDxDaxNLxx LCPFETVS . . 



Consensus LGDDVEANFT AVFAPPIRAR LEA-LPGVNL TDEDWNLMD MCPFDTVA-T 
FcplO LGDDVEANFT AVFAPPIRAR LEAHLPGVNL TDEDWNLMD MCP FDTVART 



251 300 

A. terreus 9al dD. .Aht LSPF CDLFTa . . tE WtQYNYLlSL dKYYGYGGGN 

A. terreus cbs dD. .Aht LSPF CDLFTa. .aE WtQYNYLlSL dKYYGYGGGN 

A. niger var. awamori Tv. .DTK LSPF CDLFTH. .dE WiHYDYLQSL kKYYGHGAGN 

A. niger NRRL3135 Tv . . DTK LSPF CDLFTH. .dE WiNYDYLQSL kKYYGHGAGN 

A. fumigatus 13073 SD..ASQ LSPF CQLFTH. ,nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 32722 SD. .ASQ LSPF CQLFTH.. nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 58128 SD..ASQ LSPF CQLFTH. .nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 26906 SD. .ASQ LSPF CQLFTH. .nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 32239 AD..ASE LSPF CAIFTH. .nE WkKYDYLQSL gKYYGYGAGN 

E. nidulans AH. .GTE LSPF CAIFTE . . kE W1QYDYLQSL sKYYGYGAGS 

T. thermophilus ht..DT LSPF CALsTQ . . eE WqaYDYYQSL gKYYGnGGGN 

T. lanuginosus PvlfPrQ LSPF CHLFTa. .dD WmaYDYYyTL dKYYSHGGGS 

M. thermophila SsdpATadag ggngrpLSPF CrLFSE..sE WraYDYLQSV gKWYGYGPGN 

Basidio xexxSxF CDLFexxpeE FxaFxYxgdL dKFYGtGyGQ 

Consensus SD--ATQ LSPF CDLFTH E W-QYDYLQSL - KYYGYGAGN 

FcplO SD..ATQ LSPF CDLFTH. .BE WIQYDYLQSL GKYYGYGAGN 



Fig. 4b 
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A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13 073 

A. fumigatus 32722 

A . fumi^ratus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T . thennopiii 1 us 

T. lanuginosus 

M. thermophila 

Basidio 

Consensus 
FcplO 



301 

PLGPvQGVGW 
PLGPvQGVGW 
PLGPTQGVGY 
PLGPTQGVGY 
PLGPAQGIGF 
PLGPAQGIGF 
PLGPAQGIGF 
PLGPAQGIGF 
PLGPAQGIGF 
PLGPAQGIGF 



350 

aNELMARLTR A.PVHDHTCv NNTLDASPAT FPLNATLYAD 
aNELIARLTR S . PVHDHTCv NNTLDANPAT FPLNATLYAD 
aNELIARLTH S . PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
aNELIARLTH S . PVHDDTSS NHTLDSSPAT FPLNSTLYAD 
tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
S. PVQDHTST NsTLvSNPAT FPLNATMYvD 
S. PVQDHTST NsTLvSNPAT FPLNATMYvD 
S. PVQDHTST NsTLDSDPAT FPLNATIYvD 
S.PVQDNTST NHTLDSNPAT FPLDrkLYAD 
S.PVQDYTTv NHTLDSNPAT FPLNATLYAD 
AFGPSRGVGF vNELIARMTg NlPVKDHTTv NHTLDdNPET FPLDAvLYAD 
PLGPTQGVGF vNELLARLA . GvPVRDgTST NRTLDGDPrT FPLGrPLYAD 
PLGPvQGVGY iNELLARLTx qa . VRDNTqT NRTLDSSPxT FPLNrTFYAD 

PLGPAQGVGF -NELIARLTH S-PVQDHTST NHTLDSNPAT FPLNATLYAD 
PLGPAQGVGF VNELIARLTH S . PVQDHTST NHTLDSNPAT FPLNATLYAD 



tNELIARLTR 
tNELIARLTR 
tNELIARLTN 
tNELIARLTQ 



PLGPAQGVGF vNELIARMTH 



terreus 9al 
terreus cbs 
niger var. awamori 
niger NRRL313 5 
fumigatus 13 073 
fumigatus 32722 
fumigatus 58128 
fumigatus 26906 
fumigatus 32239 
nidulans 
thermophi 1 us 
lanuginosus 
thermophila 



Consensus 
FcplO 



A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
E. 
T. 
T. 
M. 

Basidio 



351 

FSHDSnLVSI 
FSHDSnLVSI 
FSHDNGIISI 
FSHDNGIISI 
FSHDNSMVSI 
FSHDNSMVSI 
FSHDNSMVSI 
FSHDNSMVSI 
FSHDNGMIPI 
FSHDNSMISI 
FSHDNTMtSI 

FSHDNTMtGI 
FSHDNdMMGV 
FSHDNqMVAI 



FWALGLYNGT aPLSqTSVE . 
FWALGLYNGT kPLSqTTVE . 
LFALGLYNGT kPLSTTTVE . 
LFALGLYNGT kPLSTTTVE. 
FFALGLYNGT ePLSrTSVE . 
FFALGLYNGT gPLSrTSVE. 
FFALGLYNGT ePLSrTSVE. 
FFALGLYNGT ePLSrTSVE. 
FFAMGLYNGT ePLSqTSeE . 
FFAMGLYNGT qPLSmdSVE. 
FaALGLYNGT akLSTTelK. 



400 

. SvsQTDGYA AAWTVP FAAR 
. ditrTDGYA AAWTV P FAAR 
.NitQTDGFS SAWTVPFASR 
.NitQTDGFS SAWTVPFASR 
. SaKElDGYS ASWvVPFGAR 
. SaKElDGYS ASWvVPFGAR 
. SaKElDGYS ASWvVPFGAR 
.SaKElDGYS ASWvVPFGAR 
. StKESNGYS AS WAVP FGAR 
. SiQEmDGYA ASWTVPFGAR 
.SiEETDGYS AAWTVP FGGR 



FsAMGLYNGT kPLSTSklQP pTgAAADGYA AS WTV P FAAR 
LgALGaYDGv pPLdkTA. .R rdpEElGGYA ASWAVPFAAR 
FsAMGLFNqS aPLdPSxpDP nrt Wv TSklVPFSAR 



FSHDNTMVSI FFALGLYNGT -PLSTTSVEP 
FSHDNTMVSI FFALGLYNGT KPLSTTSVE. 



-S-EETDGYA ASWTVPFAAR 
. SIEETDGYA ASWTVPFAAR 



401 450 

A. terreus Sal AYVEMMQC. . ra EKEPL VRVLVNDRVM PLHGCPtDKL 

A. terreus cbs AYIEMMQC. . ra EKQPL VRVLVNDRVM PLHGCAVDNL 

A. niger var. awamori 1YVEMMQC. . Qa EQEPL VRVLVNDRW PLHGCPIDaL 

A. niger NRRL313 5 1YVEMMQC. . Qa EQEPL VRVLVNDRW PLHGCPVDaL 

A. fumigatus 13 073 AYf EtMQC . . Ks EKEPL VRaLINDRW PLHGCDVDKL 

A. fumigatus 32722 AYf EtMQC. Ks EKEPL VRaLINDRW PLHGCDVDKL 

A. fumigatus 58128 AYf EtMQC . Ks EKESL VRaLINDRW PLHGCDVDKL 

A. fumigatus 26906 AYf EtMQC . Ks EKEPL VRaLINDRW PLHGCDVDKL 

A. fumigatus 32239 AYf EtMQC. . Ks EKEPL VRaLINDRW PLHGCAVDKL 

E. nidulans AYf ELMQC. . E KKEPL VRVLVNDRW PLHGCAVDKF 

T. thermophilus AYIEMMQC. Dd sDEPV VRVLVNDRW PLHGCEVDsL 

T. lanuginosus AYVELLRC. . Etetsseeee EG. . .EDEPF VRVLVNDRW PLHGCrVDRW 

M. thermophila iYVEkMRC. . sggggggggg EGrqeKDEeM VRVLVNDRVM TLkGCGaDEr 

Basidio mvVErLxCxx xgtxxxxxxx xxxxxxxxxx VRVLVND a Vq PLEf CGgDxd 

Consensus AYVEMMQC-- E EG EKEPL VRVLVNDRW PLHGCGVDKL 

FcplO AYVEMMQC. . EA EKEPL VRVLVNDRW PLHGCGVDKL 



Fig. 4c 
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A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13 073 

A. fumigatus 32722 

A. fumigatus 5 8128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T . thermophi lus 

T. lanuginosus 

M. thermophi la 

Basidio 



451 

GRCKrDAFVA GLSFAQAG. . 
GRCKrDDFVE GLSFARAG . . 
GRCtrDsFVr GLSFARSG. . 
GRCtrDsFVr GLSFARSG. . 
GRCK1NDFVK GLSWARSG. . 
GRCK1NDFVK GLSWARSG. . 
GRCK1NDFVK GLSWARSG . . 
GRCK1NDFVK GLSWARSG. . 
GRCK1KDFVK GLSWARSG. . 
GRCt 1DDWVE GLNFARSG. . 
GRCKrDDFVr GLSFARqG. . 

GRCRrDEWIK GLTFARqG. 
GmCtlErFIE SMAFARGN. . 
GxCt 1DAFVE SqxYAReDgq 



482 

GNWADCF — 

GNWAECF ~~ 

GDWAECsA — — 

GDWAECFA 

GNWGECFS 

GNWGECFS 

GNWGECFS — — 
GNWGECFS — — 
GNSEQSFS — — 
GNWKtCFTl- — 
GNWEGCYAas e~ 

. GHWDrCF — 

GKWD1CFA — ~~ 
GDFEKCFAtp xx 



Consensus GRCK-DDFVE GLSFARSG-- GNWEECFA-- 
FcplO GRCKRDDFVE GLSFARSG. . GNWEECFA. , 
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CP-1 

ECO RI MGVFVVLLS IATLFGST17 
TATATGAATTCATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

1 + + + + + + 60 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHSCDTVDGG37 
CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAAGACTGTGACAACTGCCAC 

CP-2 

CP-3.10 

YQCFPEISHLWGQYSPFFSL57 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATTCTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTAAGAAGAGAA 

ADESAI S PDVPKGCRVTFVQ 77 
TGGCTGACGAATCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCGACTGCTTAGACGATAAAGAGGTCTGCAAGGTTTCCCGACATCTCAATGAAAGCAAG 

CP-4.10 

CP-5.10 

VLSRHGARYPTSSKSKKYSA97 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGAAGTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCTTCATGAGAC 

LIEAIQKNATAFKGKYAFLK 117 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7.10 

TYNYTLGADDLTPFGEQQMV 137 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGG 

361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACC 

NSGIKFYRRYKALARKIVPF 157 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8.10 

CP-9.10 

VRASGSDRVIASAEKFIEGF 177 
TCGTTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 
481 + + + + + + 540 

AGCAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGANPHQAS PVIN 197 
TCCAATCTGCTAAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTA 

541 + + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAAT 
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CP-10.10 

CP-11.10 

VI I PEGAGYNNTLDHGLCTA 217 
ACGTTATTATTCCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTG 

601 + + + + + + 660 

TGCAATAATAAGGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGAC 

FEESELGDDVEANFTAVFAP 237 
CTTTCGAAGAATCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTTAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAG 

CP-12.10 

PIRARLEAHLPGVNLTDEDV 257 
CACCTATTAGAGCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACG 

721 + + + + + + 780 

GTGGATAATCTCGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGGTTCTGC 

CP-13 . 10 

VNLMDMCPFDTVARTSDATQ 277 
TTGTTAACTTGATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTC 

781 + + + + + + 840 

AACAATTGAACTACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAG 

LSPFCDLFTHDEWIQYDYLQ 297 
AATTGTCTCCATTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGC 

841 + + + + + + 900 

TTAACAGAGGTAAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACG 

CP-14.10 

CP-15.10 

SLGKYYGYGAGNPLGPAQGV 317 
AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + + + + + + 960 

TTAGAAACCCATTCATGATGCCAATGCCACGAC CATTGGGTAACCCAGGTCGAGTTC CAC 

GFVNELIARLTHSPVQDHTS 337 
TTGGTTTCGTTAACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTT 

961 + + + + + + 1020 

AACCAAAGCAATTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAA 

CP-16.10 

CP-17 . 10 

TMHTLDSNPATFPLNATLYA 357 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 

1021 + + + + + + 1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNTMVS IFFALGLYNG 377 
CTGACTTCTCTCACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 1140 

GACTGAAGAGAGTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18 ■ 10 

CP-19. 10 

TKPLSTTSVES IEETDGYAA 397 
GTACTAAGCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACGCTG 

1141 + + + + + + 1200 

CATGATTCGGTAACAGATGATGAAGACAACTTAGATAAGTTCTTTGACTGCCAATGCGAC 
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SWTVPFAARAYVEMMQCEAE 417 
CTTCTTGGACTGTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTG 

1201 + + + + + + 1260 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACACTTCGAC 

CP-20. 10 

CP-21.10 

KEPLVRVLVNDRVVPLHGCG 437 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLSFAR 457 
GTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

CACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22.10 

SGGNWEECFA* Eco RI 467 
GATCTGGTGGTAACTGGGAAGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + + 1426 

CTAGACCACCATTGACCCTTCTTACAAAGCGAATTCTTAAGTATAT 
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1 

P. involutus (phyAl ) 

P. Involutus {phyA2) 

T. pubescens 

A. pediades 

P . lyci i 

A. terreus 9al KhsdCNSVDh 

A. terreus cbs NhsdCtSVDr 

A. niger var. awamori NqsTCDTVDq 

A. niger T213 NqsSCDTVDq 

A. niger NRRL3135 NqsSCDTVDq 

A. fumigatus ATCC13073 GSkSCDTVDl 

A. fumigatus ATCC32722 GSkSCDTVDl 

A . fumigatus ATCC58128 GSkSCDTVDl 

A. fumigatus ATCC26906 GSkSCDTVDl 

A. fumigatus ATCC32239 GSkACDTVEl 

E. nidulans QNHSCNTaDg 

T. thermophilus DSHS CNTVEg 

T, lanuginosus 

M. thermophila ESRPCDTpDl 



50 

pPaGCQInqV 
pPaGCelnqV 
pPaSCQInqV 
pPKDCKITqV 
pPEGCtVTqV 



-FPipeseqR nWSPYSPYFP LAEykA. . 
-FsipeseqR nWSPYSPYFP LAEykA. . 
-LDvtRDVqQ sWSmYSPYFP aAtyvA. . 
-pffpPQIqD sWAaYTPYYP VqAyTP. . 
-LPipAQnTs nWGPYdPFFP VEpyAA. . 
GYQCf PELSH kWGlYAPYFS LqDESPFPlD VPEDCHITFV 
GYQCf PELSH kWGlYAPYFS LqDESPFPlD VPDDCHITFV 
GYQCf SEtSH LWGQYAPFFS LANESAI SPD VPaGCRVTFa 
GYQCf SEtSH LWGQYAPFFS LANESvISPD VPaGCRVTFa 
GYQCf SEtSH LWGQYAPFFS LANESvISPE VPaGCRVTFa 
GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK LPKDCRITLV 
GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK LPKDCRITLV 
GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPKDCRITLV 
GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPKDCRITLV 
GYQCsPGtSH LWGQYSPFFS LEDE1SVSSD LPKDCRVTFV 
GYQCf PNVSH VWGQYSPYFS IEQESAISeD VPhGCeVTFV 
GYQCrPEISH SWGQYSPFFS LADQSEISPD VPQNCKITFV 

. nvDIAR hWGQYSPFFS LAEvSEISPA VPKGCRVeFV 

GFQCgTAISH FWGQYSPYFS VPsElDaS.. I PDDCeVTFa 
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NSHSCDTVD- GYQC-PEISH LWGQYSPFFS LADE SAX SPD VPKGCRVTFV 



P. involutus (phyAl) 

P . invol u tus {phyA2 ) 

T. pubescens 

A. pediades 

P . lyci i 

A. terreus 9al 

A, terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T . th ermophi lus 

T. lanuginosus 

M. thermophila 



51 100 
NT IqRHGARF PTSGaTtRik AgLt KLQgvq n f t DAKFnF I KSFKYdLGns 
NIIqRHGARF PTSGaAtRik AgLsKLQsvq nftDPKFDFI KSFtYdLGTs 
HI IqRHGARF PTSGaAKRiq TaVAKLKaaS nytDPlLAFV tnYtYSLGqD 
NIIqRHGARF PTSGaGtRiq AaVKKLQsak TytDPRLDFL tnYtYTLGhD 
NLIqRHGARW PTSGarsRqy AaVAKIQmar PftDPKYEFL NdFvYkFGvA 
QVLARHGARs PThSKTKaYA AtlAalQKSA TaFpGKYAFL QSYNYSLDSE 
QVLARHGARs PTdS KTKaYA At IAalQKNA TaLpGKYAFL KSYNYSMGSE 
QVLSRHGARY PTeSKGKKYS ALIEelQQNv TtFDGKYAFL KTYNYSLGAD 
QVLSRHGARY PTeSKGKKYS ALIEelQQNv TtFDGKYAFL KTYNYSLGAD 
QVLSRHGARY PTdSKGKKYS ALIEelQQNA TtFDGKYAFL KTYNYSLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTASKSKKYk kLVtalQKNA Te FKGKFAFL ETYNYTLGAD 
QVLSRHGARY PTeSKSKaYS GLIEalQKNA TsFwGQYAFL ESYNYTLGAD 
QLLSRHGARY PTSSKTElYS qLIsRIQKtA TaYKGyYAFL KdYrYqLGAN 

QVLSRHGARY PTAhKSEvYA ELLQRIQDtA TeFKGDFAFL RdYaYhLGAD 
QVLSRHGARa PTlkRAasYv DLIDRIHhGA isYgPgYEFL RTYDYTLGAD 
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QVLSRHGARY PTSSKSKKYS ALIERIQKNA T-FKGKYAFL KTYNYTLGAD 



Fig. 6a 
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P . involutus {phyAl ) 

P. involutus {phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A . terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13 073 

A. fumigatus ATCC32722 

A. fumigatus ATCC5 8128 

A. fumigatus ATCC26906 

A. fumigatus ATCC3223 9 

£?. nidula.ns 

T. thermophilus 

T. lanuginosus 

M . therznopiii 1 a 



101 

DLvPFGAaQs 
DLvPFGAaQs 
sLveLGAtQs 
DLvPFGAlQs 



150 



f DAGqEaFaR YskLvSKNnL PFIRAdGSDR WDSAtNWtA 
f DAGLEvFaR YskLvSsDnL PFIRSdGSDR WD T At NW t A 
sEAGqEaFtR YsSLvSaDeL PFVRASGSDR WATANNWtA 
sQAGeEtFQR Ysf LvSKEnL PFVRASSSNR WDSAtNWtE 
DLlPFGANQs hQTGtDMYtR YsTLf EgGdV PFVRAAGdQR WDSStNWtA 
ELTPFGrNQL rDlGaQFYeR YNAL . TRHIn PFVRATDAsR VhESAEKFVE 
NLTPFGrNQL qDlGaQFYRR YDTL. TRHIn PFVRAADSsR VhESAEKFVE 
DLTPFGEQEL VNSGIKFYQR YESL.TRNII PFIRSSGSsR VIASGEKFIE 
DLTPFGEQEL VNSGIKFYQR YESL.TRNII PFIRSSGSsR VIASGEKFIE 
DLTPFGEQEL VNSGIKFYQR YESL.TRNIV PFIRSSGSsR VIASGKKFIE 
DLTPFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTPFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTPFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTAFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTPFGEQQM VNSGIKFYQK YKAL.AgSW PFIRSSGSDR VIASGEKFIE 
DLTiFGENQM VDSGaKFYRR YKnL.ARKnt PFIRASGSDR WASAEKFIN 
DLTPFGENQM IQlGIKFYnH YKSL.ARNaV PFVRCSGSDR VIASGrlFIE 

NLTRFGEEQM MESGrQFYHR YREq.AREIV PFVRAAGSAR VIASAEf Fnr 
ELTRtGQQQM VNSGIKFYRR YRAL.ARKsI PFVRTAGqDR WhSAENFtQ 
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DLTPFGENQM VNSGIKFYRR YKAL - ARNI V PFVRASGSDR VIASAEKFIE 



P. ij2vo2utus (phyAl) 

P. involutus {phyA2) 

T. pubescens 

A. pediades 

P . lyci i 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC3223 9 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 



151 

GFaSA. 
GFaSA. 
GFalA. 
GFsAA. 
GFgdA. 



200 



..shNtvqPk LNLILPQ . . T gNDTLEDNMC PAaGD. . 
..srNaiqPk LDLILPQ . . T gNDTLEDNMC PAaGE . . 
. .ssNsiTPV LSVIISE. .A gNDTLDDNMC PAaGD . . 
..shHvlNPI LfVILSE..S LNDTLDDAMC PnaGs . . 
..sgEtvlPt LQWLQE..E gNcTLcNNMC PnevD.. 
GFQTARqDDh hAnpHQPSPr VDValPEGSA YNNTLEHSLC TAFEs . . .ST 
GFQNARqGDP hAnpHQPSPr VDWIPEGTA YNNTLEHSIC TAFEA. . .ST 
GFQSTKLkDP rAqpgQSSPk IDWISEASS sNNTLDpGtC TvFED...Se 
GFQSTKLkDP rAqpgQSSPk IDWISEASS sNNTLDpGtC TvFED. . .Se 
GFQSTKLkDP rAqpgQSSPk IDWISEASS sNNTLDpGtC TvFED... Se 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGV C TkFEA. . . Sq 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA... Sq 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA... Sq 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA... Sq 
GFQqANVADP g At . NRAAPV ISVIIPESeT YNNTLDHSVC TnFEA . . . Se 
GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNNTLDHStC vSFEN...de 
GFQSAKV1DP hSdkHDAPPt INVIIeEGPS YNNTLDtGsC PvFED...SS 

GFQdAKdrDP rSnkDQAePV INVIISEETG sNNTLDgltC PAaEE...AP 
GFHSA1LADR gStvRPTlPy dmWIPETAG aNNTLHNDLC TAFEEgpyST 
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GFQSAKLADP -A--HQASPV INVIIPEGSG YNNTLDHGL C TAPED ST 



Fig. 6b 
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P. involutus {phyAl) 

P. involutus {phyA2) 

T. pubescens 

A. pediades 

P . lyci i 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATGC32239 

S. nidularzs 

T. thermophilus 

T. lanuginosus 

M . thermophi 1 a 



201 250 
.SDpqvnaWl AVafPSItAR LNAaaPSVNL TDtDaf NLVs LCAF1TVSK. 
.SDpqvDaWl AsafPSVtAQ LNAaaPGaNL TDADafNLVs LCPFmTVSK. 
.SDpqvnQWl AqFAPPMtAR LNAgaPGaNL TDtDtyNLLt LCPFETVAt . 
.SDpqtGiWT SIYGTPIanR LNqqaPGaNI TAADVsNLIp LCAFETIvK. 
.GDESt.tWl GVFAPnltAR LNAaaPSaNL SDsDaLtLMD MCPFDTLSs . 
VGDDAvANFT AVFAPAIaqR LEAdLPGVQL StDDWNLMA MCPFETVS1T 
VGDAAADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA MCPFETVSlT 
LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LADtvEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVART 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVART 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVART 
LGDEvAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD MCSFDTVART 
LGDEvEANFT ALFAPAI RAR IEkhLPGVQL TDDDWSLMD MCSFDTVART 
rADEiEANFT AIMGPPIRkR LEndLPGIKL TNENVIyLMD MCS FDTMART 
gGHDAQEKFA kqFAPAIlEK IKDhLPGVDL AvsDVpyLMD LCPFETLARn 

. DptqpAEFl qVFGPRVlkK ItkhMPGTOL TlEDVplFMD LCPFDTVGsd 
IGDDAQDtYl StFAGPItAR VNAnL PGaNL TDAD t VaLMD LCPFETVAsS 
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LGDDAEANFT AVFAPPIRAR LEA-LPGVKL TDEDWNLMD MCPFDTVART 



251 300 

P. involutus {phyAl) ekkSdF CtLFegiPGs FeaFAYggdL dKFYGtGyGQ 

P. involutus {phyA2) eqkSdF CtLFegiPGs FeaFAYagdL dKFYGtGyGQ 

T. pubescens errSeF CDIYeelqAE .daFAYnadL dKFYGtGyGQ 

A. pediades etpSPF CNLF. .TPEE FaQFEYFgdL dKFYGtGyGQ 

P. lycii gnaSPF CDLF . . TAEE YvsYEYYydL dKYYGtGPGN 

A. terreus 9al dD..Aht LSPF CDLF. .TAtE WtQYNYLlSL dKYYGYGGGN 

A. terreus Cbs dD. .Aht LSPF CDLF. .TAAE WtQYNYLlSL dKYYGYGGGN 

A. niger var. awamori Tv. .DTK LSPF CDLF. .ThDE WiHYDYLQSL kKYYGHGAGN 

A. niger T213 Tv. .DTK LSPF CDLF. .ThDE WiHYDYLRSL kKYYGHGAGN 

A. niger NRRL3135 Tv. .DTK LSPF CDLF. .ThDE WiNYDYLQSL kKYYGHGAGN 

A. fumigatus ATCC13073 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC32722 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC58128 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC26906 SD. .ASQ LSPF CQLF.. ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC32239 AD . . ASE LSPF CAIF. .ThNE WkKYDYLQSL gKYYGYGAGN 

E. nidulans AH. .GTE LSPF CAIF..TEKE WlQYDYLQSL SKYYGYGAGS 

T. thermophilus ht..DT LSPF CALs . . TqEE WqaYDYYQSL gKYYGnGGGN 

T. lanuginosus PvlfPrQ LSPF CHLF. .TADD WmaYDYYyTL dKYYSHGGGS 

AT. thermophila SsdpATadag ggngrpLSPF CrLF. .SEsE WraYDYLQSV gKWYGYGPGN 
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SD--ATQ- 



--LSPF CDLF- -TADE W-QYDYLQSL -KYYGYGAGN 



Fig. 6c 
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P. involutus iphyAl) 

P. involutus (phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. rzigrer NRRL3135 

A. fumigatus ATCC13 073 

A. fumigatus ATCC32 722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC3223 9 

nidulans 
T. thermophilus 
T. lanuginosus 
M. thermophila 



301 350 
eLGPvQGVGY vNELIARLTN S.AVRDNTqT NRTLDASPvT F PLNkT F YAD 
ALGPvQGVGY iNELLARLTN S.AVNDNTqT NRTLDAaPDT FPLNkTMYAD 
PLGPvQGVGY iNELIARLTa q.nVsDHTqT NsTLDSSPET FPLNrTLYAD 
PLGPvQGVGY iNELLARLTE m . PVRDNTqT NRTLDSSP1T FPLDrSIYAD 
ALGPvQGVGY vNELLARLTg q . AVRDETqT NRTLDSDPAT FPLNrTFYAD 
PLGPvQGVGW aNELMARLTR A.PVHDHTCv NNTLDASPAT FPLMATLYAD 
PLGPvQGVGW aNELIARLTR S . PVHDHTCv NNTLDANPAT FPLNATLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSSPAT FPLNSTLYAD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTN S . PVQDHTST NsTLDSDPAT FPLNATIYvD 
PLGPAQGIGF tNELIARLTQ S . PVQDNTST NHTLDSNPAT FPLDrkLYAD 
PLGPAQGVGF vNELIARMTH S . PVQDYTTv NHTLDSNPAT FPLNATLYAD 

AFGPSRGVGF vNELIARMTg NlPVKDHTTv NHTLDdNPET FPLDAvLYAD 
PLGPTQGVGF vNELLARLA. GvPVRDgTST NRTLDGDPrT FPLGrPLYAD 
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PLGPAQGVGF -NELIARLTH S -PVQDHTST NHTLDSNPAT FPLNATLYAD 



P. involutus iphyAl) 

P. involutus [phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T, thermophilus 

T. lanuginosus 

M. thermophila 



351 400 

FSHDNl MVAV FsAMGLFrqP aPLSTSvpNP wrt Wr TSS1VPFSGR 

FSHDN1MVAV FsAMGLFrqS aPLSTSTpDP nrt Wl TSSvVPFSAR 

F S HDNqMVA I FsAMGLFNqS aPLdPTTpDP art Fl vkkiVPFSAR 

LSHDNqMIAI FsAMGLFNqS sPLdPSfpNP krt Wv TSRltPFSAR 

FSHDNTMVPI F a ALGLFNAT a . LdPlkpDe nrl Wv DSklVPFSGH 

FSHDSnLVSI FWALGLYNGT aPLSqTSVES Vs . . QTDGYA AAWTVPFAAR 
FSHDSnLVSI FWALGLYNGT KPLSqTTVEd It.. r TDGYA AAWTVPFAAR 
FSHDNGIISI LFALGLYNGT KPLSTTTVEN It.. QTDGFS SAWTVPFASR 
FSHDNGIISI LFALGLYNGT KPLSTTTVEN It.. QTDGFS SAWTVPFASR 
FSHDNGIISI LFALGLYNGT KPLSTTTVEN It.. QTDGFS SAWTVPFASR 
FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak. .ElDGYS ASWvVPFGAR 
FSHDNSMVSI FFALGLYNGT gPLSrTSVES ak . . ElDGYS ASWvVPFGAR 
FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak.. ElDGYS ASWvVPFGAR 
FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak.. ElDGYS ASWvVPFGAR 
FSHDNGMIPI FFAMGLYNGT EPLSqTSeES tk. .ESNGYS ASWAVPFGAR 
FSHDNSMISI FFAMGLYNGT QPLSradSVES Iq. .EmDGYA ASWTVPFGAR 
FSHDNTMtSI FaALGLYNGT akLSTTelKS I e . . ETDGYS AAWTVPFGGR 

FSHDNTMtGI FsAMGLYNGT KPLSTSklQP ptgaAADGYA ASWTVPFAAR 
FSHDNdMMGV LgALGaYDGv pPLdkTArrd . . peElGGYA ASWAVPFAAR 
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FSHDNTMVSI FFALGLYNGT KPLSTTSVES I ETDGYA ASWTVPFAAR 



Fig. 6d 
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MGVFVVLLS IATLFGSTSGT 20 
ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

! + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

ALGPRGNSHS CDTVDGGYQC 40 
GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

FPEISHLWGTYSPYFSLADE 60 
TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 

SAISPDVPDDCRVTFVQVLS 80 
TCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

187 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

RHGARYPTSSASKAYSALIE 100 
AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 

24i + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 

AI QKNATAF KGKYAFLKTYN 120 
GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 

301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

YTLGADDLTPFGENQMVNSG 140 
TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 

361 + + + + + + 420 

ATGTGAAACCCAGGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

IKFYRRYKALARKIVPFIRA 160 
ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

SGSDRVIASAEKFIEGFQSA 180 
TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 

481 + + + + + + 54 0 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 

KLADPGSQPHQASPVINVI I 200 
AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 

541 + + + + + + 600 

TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 

PEGSGYNNTLDHGTCTAFED 220 
CCAGAAGGATCCGGTTACAACAACACTTTGGACCACGGTACTTGTACTGCTTTCGAAGAC 

601 + + + + + + 660 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGACGAAAGCTTCTG 



Fig. 7a 
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SELGDDVEANFTALFAPAIR 240 
TCTGAATTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 

661 + + + + + + 720 

AGACTTAATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 

ARLEADLPGVTLTDEDVVYL 260 
GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGCAACAAATGAAC 

MDMCPFDTVARTSDATELSP 280 
ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

FCALFTHDEWIQYDYLQSLG 300 
TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 

841 + + + + + + 900 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 

KYYGYGAGNPLGPAQGVGFA 320 
AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 

901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 

NELIARLTHSPVQDHTSTNH 340 
AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

TLDSNPATFPLNATLYADFS 360 
ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

HDNTMIS IFFALGLYNGTKP 380 
CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 

1081 + + + + + + 1140 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 

LSTTSVES IEETDGYSASWT 400 
TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

VPFAARAYVEMMQCQAEKEP 420 
GTTCGATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTGAAAAGGAACCA 

1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 

LVRVLVNDRVVPLHGCAVDK 440 
TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 



Fig. 7b 
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LGRCKRDDFVEGLSFARSGG 460 
TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 



UWAECFA* 467 
AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1410 

TTGACCCGACTTACAAAGCGAATT 



Fig. 7c 
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MGVFVVLLS IATLFGSTSGT 20 
ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

1 + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 



ALGPRGNSHS CDTVDGGYQC 40 
GCCTTGGGTCCTCGTGGTAACTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

61 + + + + + + 120 

CGGAACCCAGGAGCACCATTGAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 



FPEISHLWGTYSPFFSLADE 60 
TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 



SAISPDVPKGCRVTFVQVLS 80 
TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 



RHGARYPTS SASKAYSALIE 100 
AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCGTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGCATGAGACGAAACTAACTT 



AI QKMATAFKGKYAFLKTYN" 120 
GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 

301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

YTLGADDLTPFGEQQMVNSG 140 
TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 

361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 



IKFYRRYKALARKIVPFIRA 160 
ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 



SGSDRVIASAEKFIEGFQSA 180 
TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 

481 + + + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 



KLADPGANPHQAS PVINVI I 200 
AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 

541 + + + + + + 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 



PEGAGYNNTLDHGLCTAFEE 220 
CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTTTCGAAGAA 

601 + + + + + + 660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 

Fig. 8a 
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SELGDDVEANFTAVFAPPIR 240 
TCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 

661 + + + + + + 720 

AGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 

ARLEAHLPGVNLTDEDVVNL 260 
GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 

MDMCPFDTVARTSDATQLSP 280 
ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

FCDLFTHDEWIQYDYLQSLG 300 
TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 

841 + + + + + + 900 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

KYYGYGAGNPLGPAQGVGFV 320 
AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 

901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCAA 

NELIARLTHS PVQDHTSTNH 340 
AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

951 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

TLDSNPATFPLNATLYADFS 360 
ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

HDNTMVS I FFALGLYNGTKP 380 
CACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCCA 

1081 + + + + + + H40 

GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGATTCGGT 

LSTTSVES IEETDGYSASWT 400 
TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

VPFAARAYVEMMQCEAEKEP 420 
GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 

1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

LVRVLVNDRVVPLHGCGVDK 440 
TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGGTGTTGACAAG 

1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACCACAACTGTTC 



Fig. 8b 
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LGRCKRDDFVEGLSFARSGG 
TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

NWEECFA* 467 
AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 



Fig. 8c 
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MGVFVVLLS IATLFGSTSGT 20 
ATGGGGGTTTTCGTCGTTCTATTATCTATCGCGACTCTGTTCGGCAGCACATCGGGCACT 

X + + + + + + 60 

TACCCCCAAAAGCAGCAAGATAATAGATAGCGCTGAGACAAGCCGTCGTGTAGCCCGTGA 

ALGPRGNHSKSCDTVDLGYQ 40 
GCGCTGGGCCCCCGTGGAAATCACTCCAAGTCCTGCGATACGGTAGACCTAGGGTACCAG 

51 + + + + + + 120 

CGCGACCCGGGGGCACCTTTAGTGAGGTTCAGGACGCTATGCCATCTGGATCCCATGGTC 

CSPATSHLWGTYSPYFSLED 60 
TGCTCCCCTGCGACTTCTCATCTATGGGGCACGTACTCGCCATaCTTTTCGCTCGAGGAC 

121 + + + + + + 180 

ACGAGGGGACGCTGAAGAGTAGATACCCCGtgCATGAGCGGTAtGAAAAGCGAGCTCCTG 

ELSVS SKLPKDCRITLVQVL 80 
GAGCTGTCCGTGTCGAGTAAGCTTCCCAAGGATTGCCGGATCACCTTGGTACAGGTGCTA 

181 + + + + + + 240 

CTCGACAGGCACAGCTCATTCGAAGGGTTCCTAACGGCCTAGTGGAACCATGTCCACGAT 

SRHGARYPTSSKSKKYKKLI 100 
TCGCGCCATGGAGCGCGGTACCCAACCAGCTCCAAGAGCAAAAAGTATAAGAAGCTTaTt 

241 + + + + + -f 300 

AGCGCGGTACCTCGCGCCATGGGTTGGTCGAGGTTCTCGTTTTTCATATTCTTCGAAtAa 

TAIQANATDFKGKYAFLKTY 120 
ACGGCGATCCAGGCCAATGCCACCGACTTCAAGGGCAAGTacGCCTTTTTGAAGACGTAC 

301 + + + + + + 360 

TGCCGCTAGGTCCGGTTACGGTGGCTGAAGTTCCCGTTCAtgCGGAAAAACTTCTGCATG 

NYTLGADDLTPFGEQQLVNS 140 
AACTATACTCTGGGTGCGGATGACCTCACTCCCTTTGGGGAGCAGCAGCTGGTGAACTCG 

361 + + + + + + 420 

TTGATATGAGACCCACGCCTACTGGAGTGAGGGAAACCCCTCGTCGTCGACCACTTGAGC 

GIKFYQRYKALARSVVPFIR 160 
GGCATCAAGTTCTACCAGAGGTACAAGGCTCTGGCGCGCAGTGTGGTGCCGTTTATTCGC 

421 + + + + + + 480 

CCGTAGTTCAAGATGGTCTCCATGTTCCGAGACCGCGCGTCACACCACGGCAAATAAGCG 

ASGSDRVIASGEKFIEGFQQ 180 
GCCTCAGGCTCGGACCGGGTTATTGCTTCGGGAGAGAAGTTCATCGAGGGGTTCCAGCAG 

461 + + + + + + 540 

CGGAGTCCGAGCCTGGCCCAATAACGAAGCCCTCTCTTCAAGTAGCTCCCCAAGGTCGTC 

AKLADPGATNRAAPAISVI I 200 
GCGAAGCTGGCTGATCCTGGCGCGACGAACCGCGCCGCTCCGGCGATTAGTGTGATTATT 

541 + + + + + + 600 

CGCTTCGACCGACTAGGACCGCGCTGCTTGGCGCGGCGAGGCCGCTAATCACACTAATAA 

PESETFNNTLDHGVCTKFEA 220 
CCGGAGAGCGAGACGTTCAACAATACGCTGGACCACGGTGTGTGCACGAAGTTTGAGGCG 

601 + + + + + + 660 

GGCCTCTCGCTCTGCAAGTTGTTATGCGACCTGGTGCCACACACGTGCTTCAAACTCCGC 



Fig. 9a 
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SQLGDEVAANFTALFAPDIR 240 
AGTCAGCTGGGAGATGAGGTTGCGGCCAATTTCACTGCGCTCTTTGCACCCGACATCCGA 

66i + + + + + + 720 

TCAGTCGACCCTCTACTCCAACGCCGGTTAAAGTGACGCGAGAAACGTGGGCTGTAGGCT 

ARLEKHLPGVTLTDEDVVSL 260 
GCTCGCctCGAGAAGCATCTTCCTGGCGTGACGCTGACAGACGAGGACGTTGTCAGTCTA 

721 + + + + + + 780 

CGAGCGgaGCTCTTCGTAGAAGGACCGCACTGCGACTGTCTGCTCCTGCAACAGTCAGAT 

MDMCPFDTVARTSDASQLSP 280 
ATGGACATGTGTcCGTTTGATACGGTAGCGCGCACCAGCGACGCAAGTCAGCTGTCACCG 

781 + + + + + + 840 

TACCTGTACACAgGCAAACTATGCCATCGCGCGTGGTCGCTGCGTTCAGTCGACAGTGGC 

FCQLFTHNEWKKYDYLQSLG 300 
TTCTGTCAACTCTTCACTCACAATGAGTGGAAGAAGTACgACTACCTTCAGTCCTTGGGC 

841 + + + + + + 900 

AAGACAGTTGAGAAGTGAGTGTTACTCACCTTCTTCATGCTGATGGAAGTCAGGAACCCG 

KYYGYGAGNPLGPAQGIGFT 320 
AAGTACTACGGCTACGGCGCAGGCAACCGTCTGGGACCGGCTCAGGGGATAGGGTTCACC 

901 + + + + + + 960 

TTCATGATGCCGATGCCGCGTCCGTTGGGAGACCCTGGCCGAGTCCCCTATCCCAAGTGG 

NELIARLTRSPVQDHTSTNS 340 
AACGAGCTGATTGCCCGGTTGACgCGTTCGCCAGTGCAGGACCACACCAGCACTAACTCG 

961 + + + + + + 1020 

TTGCTCGACTAACGGGCCAACTGcGCAAGCGGTCACGTCCTGGTGTGGTCGTGATTGAGC 

TLVSNPATFPLNATMYVDFS 360 
ACTCTAGTCTCCAACCCGGCCACCTTCCCGTTGAACGCTACCATGTACGTCGACTTTTCA 

1021 + + + + + + 1080 

TGAGATCAGAGGTTGGGCCGGTGGAAGGGCAACTTGCGATGGTACATGCAGCTGAAAAGT 

HDNSMVS I FFALGLYNGTEP 380 
CACGACAACAGCATGGTTTCCATCTTCTTTGCATTGGGCCTGTACAACGGCACTGAACCC 

1081 + + + + + + H40 

GTGCTGTTGTCGTAC CAAAGGTAGAAGAAACGTAACCCGGACATGTTGC CGTGACTTGGG 

LSRTSVESAKELDGYSASWV 400 
TTGTCCCGGACCTCGGTGGAAAGCGCCAAGGAATTGGATGGGTATTCTGCATCCTGGGTG 

1141 + + + + + + 1200 

AACAGGGCCTGGAGCCACCTTTCGCGGTTCCTTAACCTACCCATAAGACGTAGGACCCAC 

VPFGARAYFETMQCKSEKEP 420 
GTGCCTTTCGGCGCGCGAGCCTACTTCGAGACGATGCAATGCAAGTCGGAAAAGGAGCCT 

1201 + + + + + + 1260 

CACGGAAAGCCGCGCGCTCGGATGAAGCTCTGCTACGTTACGTTCAGCCTTTTCCTCGGA 

LVRAL INDRVVPLHGCDVDK 440 
CTTGTTCGCGCTTTGATTAATGACCGGGTTGTGCCACTGCATGGCTGCGATGTGGACAAG 

1261 + + + + + + 1320 

GAACAAGCGCGAAACTAATTACTGGCCCAACACGGTGACGTACCGACGCTACACCTGTTC 



Fig. 9b 
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LGRCKLNDFVKGLSWARSGG 460 
CTGGGGCGATGCAAGCTGAATGACTTTGTCAAGGGATTGAGTTGGGCCAGATCTGGGGGC 

1321 + + + + + + 1380 

GACCCCGCTACGTTCGACTTACTGAAACAGTTCCCTAACTCAACCCGGTCTAGACCCCCG 

NWGECFS* 467 
AACTGGGGAGAGTGCTTTAGTTGA 

1381 + + 1404 

TTGACCCCTCTCACGAAATCAACT 
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CP-1 

Eco RI MGVFVVLLS IATLFGST 
TATATGAarrCATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 



SGTALGPRGNSHSCDTVDGG 
CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 

YQCFPEISHLWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 



EDESAISPDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4.7 

CP-5.7 

VLSRHGARYPTDSKGKKYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTgacTCTAAGggtAAGaagTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAc tgAGATTCc c aTTCtt c ATGAGAC 

L I EAI QKNATAFKGKYAFL K 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 -I- + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGI KFYRRYKALARKIVPF 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8.7 

CP-9 

IRASGSSRVIASAEKFIEGF 
TCATTAGAGCTTCTGGTTCT t c t AGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGTAATCTCG2UVGACCAAGAagaTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQAS PVID 
TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 



Fig. 10a 
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CP-10.7 

CP-11.7 

VI ISEASSYNNTLDPGTCTA 
ACGTTATTATTtctGAcgctTCTtctTACAACAACACTTTGGACccaGGTACTTGTACTG 

601 + + + + + + 660 

TGCAATAATAAagaCTgcgaAGGagaATGTTGTTGTGAAACCTGggtCCATGAACATGAC 

FEDSELADTVEANFTALFAP 
CTTTCGAAGACTCTGAATTGgctGACactGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTGAGACTTAACcgaCTGtgaCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12 .7 

AIRARLEADLPGVTLTDTEV 
CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACactgaaG 

721 + + + + + + 780 

GTCGATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGtgacttC 

CP-13 .7 

TYLMDMCSFETVARTSDATE 
TTac tTACTTGATGGACATGTGT t C tTTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + + 840 

AAtgaATGAACTACCTGTAGACAagaAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRHYDYLQ 
AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGAcacTACGACTACTTGC 

841 + + + + + + 900 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTgtgATGCTGATGAACG 

CP-14.7 

CP-15 .7 

SLKKYYGHGAGNPLGPTQGV 
AATCTTTGaagAAGTACTACGGTc acGGTGCTGGTAACCCATTGGGTCCAac tCAAGGTG 

901 + + + + + + 960 

TTAGAAACttcTTCATGATGCCAgtgCCACGACCATTGGGTAACCCAGGTtgaGTTCCAC 

GFANELIARLTRSPVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 

961 + + + + + + 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17.7 

TNHTLDSNPATFPLNATLYA 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 

1021 + + + + + + 1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDMGIISIFFALGLYNG 

CTGACTTCTCTCACGACAACggtattATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 1140 

GACTGAAGAGAGTGCTGTTGc c at a aTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18.7 

CP-19.7 

TAPLSTTSVES IEETDGYSS 

GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTt 

1141 + + + + + + 120Q 

CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAa 



Fig. 10b 
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AWTVPFASRAYVEMMQCQAE 
c tgc tTGGACTGTTCCATTCgc 1 1 c t AGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 

1201 + + + + + + 1260 

gacgaACCTGACAAGGTAAGcgaagaTCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLSFAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* ECO RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 



Fig. 10c 



34/56 
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1 MGVFWLLSI ATLFGSTSGT ALGPRGNSHS CDTVDGGYQC FPEISSNWSP 

51 YSPYFSLADE SAISPDVPKG CRVTFVQVLQ RHGARFPTSG AATRI SALIE 

101 AIQKNATAFK GKYAFLKTYN YTLGADDLVP FG AN Q SSQA G I KF YRRYKAL 

151 ARKIVPFIRA SGSDRVIDSA TNW IEGFQSA KL AD PGANPH QASPVINVII 

201 PEGAGYNNTL DHGLCTAFEE SELGDDVEAN FTAVFAPPIR ARLEAHLPGV 

251 NLTDEDWNL MDMCPFDTVA RTSDATELSP FCDLFTHDEW IQYDYLGDLD 

301 KYYGTGAGNP LGPAQGVGFV NELIARLTHS PVQDHTSTNH TLDSNPATFP 

3 51 LNATLYADFS HDNTMVAI FF ALGLYNGTKP LSTTSVESIE ETDGYSASWL 

4 01 VP F S ARM YVE MMQCEAEKEP LVRVLVNDRV VPLHGCGVDK LGRCKRDDFV 
451 EGLSFARSGG NWEECFA 



Fig. 21 
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ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

1 + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

MGVFVVLLS IATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 

FPEISHLWGTYSPYFSLADE 

TCTGCTATTTCTCCAGACGTCCCAAAGGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

181 + -f + + + + 240 

AGACGATAAAGAGGTCTGCAGGGTTTCCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKDCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSKSKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

AIQKNATAFKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 
361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

YTLGADDLTPFGENQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

I KFYRRYKALARKIVPF IRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 



AGAC CAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTC CAAAGGTTAGACGA 
SGSDRVIASAEKFIEGFQSA 
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AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 



541 



+ 



+ 



+ 600 



TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 

KLADPGSQPHQASPVINVII 

CCAGAAGGATCCGGTTACAACAACACTTTGGACCATGGTCTTTGTACTGCTTTCGAAGAC 

601 + + + + + + 660 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTACCAGAAACATGACGAAAGCTTGTG 

PEGSGYNNTLDHGLCTAFED 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 

661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 

STLGDDVEANFTALFAPAIR 

GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGCAACAAATGAAC 

ARLEADLPGVTLTDEDVVYL 

ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

MDMCPFDTVARTSDATELSP 

TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 

841 + + + + + + 900 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 

FCALFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 

901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 

KYYGYGAGNPLGPAQGVGFA 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

NELIARLTHSPVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

TLDSNPATFPLNATLYADFS 
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CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 
1081 + + + + + + 1140 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 
HDNTM I S I FFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 -f + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

LSTTSVES IEETDGYSASWT 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTGAAAAGGAACCA 
1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCQAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVWDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

LGRCKRDDFVEGLSFARSGG 

AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCGACTTACAAAGCGAATT 

NWAECFA* 



Fig. 22c 
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ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

1 + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

MGVFVVLLS IATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACC C CATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 

FPEISHLWGTYSPYFSLADE 

TCTGCTATTTCTCCAGACGTCCCAAAGGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAGGGTTTCCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKDCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSASKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

AIQKNATAFKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 

361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

YTLGADDLTPFGENQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

IKFYRRYKALARKIVPFIRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 54Q 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 
SGSDRVIASAEKFIEGFQSA 



Fig. 23a 
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AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 



541 



+ 



+ 



+ 



+ 



+ 



+ 600 



TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 

KLADPGSQPHQASPVINVII 

CCAGAAGGATCCGGTTACAACAACACTTTGGACCATGGTCTTTGTACTGCTTTCGAAGAC 

601 + + + + + + 660 

GGTCTTCCTAGGCCAATGTTGTTGTGAAAGCTGGTACCAGAAACATGACGAAAGCTTCTG 

PEGSGYNNTLDHGLCTAFED 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 

661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 

STLGDDVEANFTALFAPAIR 

GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGCAACAAATGAAC 

ARLEADLPGVTLTDEDVVYL 

ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

MDMCPFDTVARTSDATELSP 

TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 

841 + + + + + + 900 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 

FCALFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 

901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACC CAGGTCGAGTTCCACAAC CAAAGCGA 

KYYGYGAGNPLGPAQGVGFA 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

NELIARLTHSPVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

TLDSNPATFPLNATLYADFS 




23b 
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CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 

1081 + + + + + + 1140 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 

HDNTMIS IFFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

LSTTSVES IEETDGYSASWT 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTGAAAAGGAACCA 

1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCQAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVKTDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

LGRCKRDDFVEGLSFARSGG 

AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCGACTTACAAAGCGAATT 

NWAECFA* 
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ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

1 + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

MGVFVVLLSIATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 

FPEISHLWGTYSPFFSLADE 

TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKGCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSKSKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAT 

301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTA 

AIQKNATAFKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 

361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

YTLGADDLTPFGEQQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

I KFYRRYKALARKIVPF I RA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCCGAAAAGTTCATTGAAGGTTTCCAATCTGCT 

481 + + + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGGCTTTTCAAGTAACTTCCAAAGGTTAGACGA 

SGSDRVIASAEKFIEGFQSA 



Fig. 24a 
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AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 

541 + + + + + + 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 

KLADPGANPHQASPVINVI I 

CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTTTCGAAGAA 

601 + + + + + + 660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 

PEGAGYNNTLDHGLCTAFEE 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 

661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 

STLGDDVEANFTAVFAPPIR 

GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 

ARLEAHLPGVNLTDEDVVNL 

ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

MDMCPFDTVARTSDATQLSP 

TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 

841 + + + + + + 900 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

FCDLFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 

901 + + + + + + 960 

TTCATGATGC CAATGCCACGACCATTGGGTAACC CAGGTCGAGTTC CACAACCAAAGCAA 

KYYGYGAGNPLGPAQGVGFV 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

961 + + + + + + 1020 

TTGCTTAACTAAGGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

NELIARLTHSPVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

TLDSNPATFPLNATLYADFS 



Fig. 24b 
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CACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCCA 

1081 + + + + + + 1140 

GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGATTCGGT 

HDNTMVS I FFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

LSTTSVES IEETDGYSASWT 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 

1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCEAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVNDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

LGRCKRDDFVEGLSFARSGG 

AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 

NWEECFA* 
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ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

1 + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

MGVFVVLLS IATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 

FPEISHLWGTYSPFFSLADE 

TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKGCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSASKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAT 

301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTA 

AIQKNATAFKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 

361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

YTLGADDLTPFGEQQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

I KFYRRYKALARKIVPFIRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCCGAAAAGTTCATTGAAGGTTTCCAATCTGCT 

481 + + + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGGCTTTTCAAGTAACTTCCAAAGGTTAGACGA 

SGSDRVIASAEKFIEGFQSA 



Fig. 25a 
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AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 

541 + + + + + + 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 

KLADPGANPHQASPVINVII 

CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTTTCGAAGAA 

601 + + + + + + 660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 

PEGAGYNNTLDHGLCTAFEE 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 

661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 

STLGDDVEANFTAVFAPPIR 

GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 

ARLEAHLPGVNLTDEDVVNL 

ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

MDMCPFDTVARTSDATQLSP 

TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 

841 + + + + + + 900 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

FCDLFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 

901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCAA 

KYYGYGAGNPLGPAQGVGFV 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

NELIARLTHSPVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

TLDSNPATFPLNATLYADFS 
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CACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCCA 

1081 + + + + + + 1140 

GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGATTCGGT 

HDNTMVS I FFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

LSTTSVES IEETDGYSASWT 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 

1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCEAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVNDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

LGRCKRDDFVEGLSFARSGG 

AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 

NWEECFA* 
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